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Codes Which Detect Deception 


By E. N. GILBERT, Mrs. F. J. MacWILLIAMS, and N. J. A. SLOANE 
(Manuscript received May 15, 1973) 


We consider a new kind of coding problem, which has applications in a 
variety of situations. A message x is to be encoded using a key m to form 
an encrypted message y = ®(x, m), which is then supplied to a user G. 
G knows m and so can calculate x. It is desired to choose ®(-,-) so as to 
protect G against B, who knows x, y, and ®(-,-) (but not m); B may 
substitute a false message y’ for y. It is shown that af the key can take K 
values, then an optimal strategy for B secures him a probability of an 
undetected substitution = K-*. Several encoding functions ®(-,-) are 
given, some of which achieve this bound. 


1. INTRODUCTION 


The gambling casino has often supplied a vivid and concrete setting 
for problems in probability theory,! stochastic processes,? hypothesis 
testing,* information theory,* and coding theory,’ and we shall use it 
to describe our problem. 

There are two main participants, the owner of the casino G (stand- 
ing for good guy) and the manager B (the bad guy). B has been re- 
porting the daily takings from the slot machines to be less than they 
actually are and keeping the difference for himself. To prevent this, 
G proposes to install in each slot machine a key generator of which he 
possesses an exact duplicate and an encoder which will encrypt the 

405 


day’s takings x using a key m to produce an encrypted message 
y = &(z, m). (1) 


(See Figs. 1 and 2.) The device will punch y onto a paper tape. At 
suitable intervals B will mail the tape to G, who will calculate x from 
y and m. From time to time G will visit the casino to change the key 
generator. We assume that B knows z and ®(-,-) (but cannot change 
them), and y (which he can change), but does not know m. G knows 
y, m, and &(-,-). 

If B attempts to give G a false message y,, there may be no 2’ 
satisfying y, = ®(z’, m), and then G will discover B’s deception. But 
if B can solve (1) for m, then he can successfully substitute a false 
message v’ by giving G the correctly encrypted message y’ = ®(x’, m). 
The problem is to design ®(-,-) so as to make it as difficult as possible 
for B to deceive G without being caught. 

Clearly, the problem is applicable to other situations (vending 
machines, cash registers, etc.) and in fact was first presented to us by 
G. J. Simmons of Sandia Corporation in connection with monitoring 
the production of certain materials in the interests of arms limitation. 

The problem resembles the one normally encountered in cryptog- 
raphy in that a key m is used to encrypt a clear text x into an encoded 
form y = ®(z, m). But there is an important difference. Since B knows 
x already, many of the standard cryptographic codes would allow B 
to recover the key m. 

To prevent B from using (1) to learn the key, G must construct 
@(-,-) so that (1) has several solutions m. Then B will probably pick 
a wrong key m, and G will discover that B’s encrypted mesage y, is 
incompatible with the correct key. As one might expect, to provide 
many solutions to (1) G must use a large number K of possible keys. 


FALSE MESSAGE x’ 








Y 
GENERATOR 


~A 
m 





KE 
GENERATOR 
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DECODER 


x,x’, OR WARNING 


MESSAGE x 


Fig. 1—Encoding to detect substitution. 
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MESSAGE KEY ENCRYPTED 
MESSAGE 





Fig. 2—Diagram of a code. 


When B tries to substitute a false message, his probability of escap- 
ing detection will be called p,. The probability p, for an optimal B 
strategy will be called p3. We will show that p; = K—*. Although 
Section IV will construct a code which is best-possible in the sense of 
achieving p, = K-}, this equality can be achieved only by severely 
restricting the number N of possible messages x. More useful codes 
must compromise among three conflicting goals for G: small p;, small 
K, and large N. We give two such codes, one random (Section VII) 
and one systematic (Section VIII). 

Throughout most of this paper we imagine that B has a particular, 
but unknown, false message x’ to substitute for x. We assume that x 
is equally likely to be any one of the N possibilities and that B picks 
x’ at random from the remaining N — 1 messages. Then p, is an average 
of the probabilities p(x, y, x’) of success when B substitutes a given 
x’ for given x, knowing y. 
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In Section IX, B uses a different strategy. There B is content to 
succeed in any deception. Given x and y, B now picks x’ to maximize 
the chance of escaping detection. Merely keeping p. small does not 
protect G against this if individual terms p,(z, y, x’) are large. With 
proper design, the systematic code of Section VIII still defeats B. 


Il. THE AUTHENTICATOR 


A convenient special form for the encryption (1) is 
y = (x; 2), (2) 


ie., y is the clear text x followed by a string z of extra digits or letters. 
Here z is some function of x and m. G will use z to test the received 
message y for authenticity. For this reason z will be called an 
authenticator. 

Although (2) is a special case of (1), nothing is lost by restricting the 
encryption to this special form. Indeed, if some other ®,(x, m) in (1) 
provides a good code, one can always create a code of the form (2) 
by taking z = ®,(z, y), ie., 


y = &(2, m) = [a; (x, m)]. 


Including z as part of y cannot help B; he knows z already. Giving x 
to G explicitly cannot hinder him in detecting a deception by B. Thus 
the new code is at least as good for G as the old one. 

Whether or not to use a code of the form (2) is purely a matter of 
convenience. However, the form (2) has a special property which we 
can now require without loss for all codes. It is that different clear 
text messages x1, %2 cannot be encoded into the same y, i.e., 


(m1, 21) % B(me, Lo) (3) 


holds for all mi, mez if 41 ¥ x2. Then a typical code has a diagram like 
Fig. 2 which portrays clear messages x as points in the left column and 
encrypted messages y as points in the right column. The lines directed 
from left to right are labeled by the key names 1, ---, K to show how 
these keys encode each zx into a y. Because of (3) the encrypted mes- 
sages y fall into disjoint clusters, each cluster containing all possible 
images of a particular z. 


lll. PROBABILITY OF DECEPTION 
B successfully deceives G with probability p, 2 K—! just by guessing 


a key m, at random with all K keys equally likely. Better strategies use 
B’s knowledge of x and y to restrict his guess to keys satisfying (1). 
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Usually B need not guess m, = m, the correct key. B still succeeds if 
&(x’, mo) = B(x’, m). (4) 


In Fig. 2, B would pick m, to be one of 2, 3, 4, 5, or 6; if m = 2 then 
the guesses m, = 2, 4, or 6 all succeed. 

An important qualitative feature of a code is the size of the bundle 
of lines leading from the message x to the encrypted message y in the 
code diagram (Fig. 2). G must make these bundles large enough to 
prevent B from guessing m with high probability. But if the bundles 
are too large, B will succeed often because many keys m, satisfy (4). 
In compromising between the two extreme bundle sizes, G cannot limit 
B to a probability p, = 1/K. In fact, we now show that B can always 
use a strategy which succeeds with probability 


Do = K-t. (5) 


In order to prove (5) we will have to place some natural restric- 
tions on the behavior of G and B. 


(a) B does not attempt to deceive G by replacing x by xz’ = x. If 
we allowed B that kind of ‘‘deception,”’ B could succeed with 
probability p, = 1 and (5) would be a weak result. 

(b) All NV messages 2 are equally likely. Although this requirement 
could be relaxed, some condition like it must be imposed to 
forbid G from using one particular message 2; almost exclu- 
sively. In that case G could let all keys encrypt x1 to the same 
y1 but give all other messages x’ K distinct encrypted forms. 
B would then have p, < K-? but G would receive little 
information from each message. 

(c) Another restriction on G might be that he use the K keys at 
random, equally likely and independent of x. We won’t need 
this restriction on G to prove (5). If G uses the keys in any 
other way he only helps B increase po. 

(d) We will prove that (5) holds even if B picks x’ at random from 
the N — 1 messages different from z, all equally likely. This 
only strengthens (5) because there may be better strategies 
for B. 


Knowing how the message zx, x’, and key m are distributed, we can 
compute the joint probability P(2, y, x’). This probability is the weight 
used in averaging p.(z, y, x’) to get 


Po= 2 Py, X)polz, y, x’), (6) 


zyy,x! 
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as mentioned in Section I. The probability p,(z, y, x’), that B succeeds 
in substituting z’, knowing x and y, depends on how B uses g, y, x’ 
to determine a false encrypted message y,. B knows the function 
@(-,-) and the key distribution. From these, he can compute the 
conditional probability distribution P(y’|z, y, x’) of the correctly en- 
crypted false message y’ = &(x’, m). B maximizes his chance of suc- 
cess by using a false message y, which maximizes P(y’|zx, y, x’). Then 
B achieves 


Pola, y, v’) = Max Ply’ |z, y, x’) (7) 
y! 


and maximizes p, in (6). Since (7) is optimal for B we give the corre- 
sponding p, value a special name 7,. 

As a preliminary to (5) we now relate p, to the average uncertainty 
U which B has about the correctly encrypted false message y’. U is a 
conditional entropy 


U = Hy’ |2, y, 2’) 
= 2 ; Pe, y, x’, y') log P(y’|2, Y) 3 (8) 
Ty,Y,r vy 
Lemma: If B chooses y, to make (7) hold, then 
Po = Do 2 24, (9) 


Equality holds in (9) if and only af all the possible encrypted messages y’ 
for each (x, y, x’) having P(x, y, x) ¥ 0 are equally likely and there are 
exactly 2¥ such y’. 

The proof does not require restrictions (a), (b), (c), or (d). Use (7) 
to write P(y’|z, y, x’) S po(x, y, x’) in (8). Sum on y’ and use the 
convexity of the function —log p to get 

UZ— 2» P(x, y, x’) log p.(x, y, 2’) 


x,y,z! 


IV 


= log 25 , P(a, Y; x’) po(x, Y; x’). 
Z,Y,x 


Now (9) follows from (6). 

The derivation used two inequalities. Both must become equalities 
if equality holds in (9). P(y’|z, y, 2’) = p.(x, y, x’) requires all possible 
y’ to be equally likely for given z, y, 2’. In the convexity argument, 
equality requires all —log p,(z, y, x’) terms to be equal to U. 

We now bound p; in terms of the uncertainty H(m) associated with 
the choice of key. 
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Theorem 1: Suppose (7) and restrictions (a), (b), (d) all hold. Then 
Po = Po = 2-H (m), (10) 


First note that y’ is determined by y’ = &(m, 2’) if m, x’ are known. 
Then y’ contains less information than (m, x’): 


U = Aty'|a, y, «') S H(m, 2'|z, y, 2’) = H(m|a,y, 2). (11) 


_ But the conditional probability for m given z, y, x’ depends only on 
x, y, so (11) becomes 
U < H(m|z, y). (12) 
Also, 
H(m) 2 H(m|x) = H(m, y|x) = H(y|z) + H(m|a, y) 


so (12) provides 
U = H(m) — H(y{z). (13) 
But 
U=Hy'|z, y, 2’) S A(y’|z’). 
Because of constraint (d), x’ is equally likely to be any one of the 


N messages. Then, by (b), x and 2’ have the same distribution, 
H(y'|z') = H(y|zx), and finally 


U Ss Hly|2). (14) 


Now compare (13) and (14). If H(y|x) S$ 4H(m), then U S 34H (m) 
follows from (14). If H(y|x) = 4H(m), then U Ss 4$H(m) follows from 
(13). In either case, (10) follows from the lemma. 


Remark: The bound (10) implies (5), and in fact reduces to (5) when 
restriction (c) holds. 


IV. PROJECTIVE PLANE CODES 


Since p, is the largest probability of success obtainable by B, a 
code for which equality holds in (10) guarantees G the minimum p, 
against optimal behavior by B. This section designs such a code. We 
now assume that G behaves according to (c) of Section III, for that 
will make 

po = K-}. 


If equality is to hold in (10), all the inequalities used in proving 
Theorem 1 must become equalities. We now review these inequalities 
to obtain requirements on the code. 
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The requirements are most easily stated in terms of the bundles of 
keys in the code diagram, Fig. 2. 


(t) Every pair of bundles, from 2; to yi and 22 to y2, with a. ~ 2, 
have exactly one key in common. 
(it) Every bundle contains K} keys. 
(iat) There are K? bundles at each z. 


To prove (2), (22), (cz), begin with (11) and write H(y’|x, y, 2’) 
= H(m, x'|x, y, x’). If, for some a, y, x’, more than one key m satisfied 
y’ = &(m, zx’) then there would be more conditional uncertainty about 
the pair (m, x’) than about y’. Thus equality in (11) requires 


(i) Every pair of bundles, from 2 to y; and 22 to ye, 2 ¥ X, 
have at most one key in common. 


Equality in (9) requires that the keys in any bundle from z to y 
be distributed equally over 24 = 234(™ = K? images y’ of any 2’. 
Each of these keys leads from 2’ to a different y’ [by (7’)]. Then the 
bundle x to y has K} keys, which proves (77). Now (277) follows from 
(it) because there are only K keys. Requirements (77) and (777) also 
guarantee H(y|x) = 4log K = 4H(m), which is needed for equality 
in (18) and (14). 

To strengthen (z’) to (z) consider the K? bundles leaving x and the 
K} bundles leaving x’. There are K}-K? = K pairs of bundles. (2’) 
permits each pair to have at most one key in common. But each key 
is common to some pair. Since there are K keys, (¢) must hold. 

One can find trivial codes which satisfy (2), (2), (ii) but which 
have only a few messages x. For instance, the K keys might be arranged 
in a K? X K} square matrix and each row (or column) be designated 
as the bundle for a distinct encrypted form of x1 (or v2). Since this code 
has N = 2 it is not very useful. In order to force N to be large we need 
another requirement. 

Since (2) requires a pair (m1, m2) of different keys to belong to at 
most one bundle, the number of pairs of keys having a common bundle 
is N (E'). This number must be no greater than the unrestricted number 
of pairs of keys (#), so that 


INKY(K} — 1) $ $K(K — 1) 
N s K?+1. (15) 
The condition for equality in (15) is 


(iv) Every pair (m1, mz) of different keys belongs to exactly one 
common bundle. 
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We now add requirement (iv) in order to have a code with the largest 
possible N. Note that even for this code (15) indicates only about 
half as many message bits as key bits. 

A code satisfying (2), (zz), (a7), (iv) can be constructed from any 
finite projective plane. Recall that a projective plane is a set of points 
and lines in which: 


(v) Each pair of different lines has a unique point in common, and 
(vt) Each pair of different points belongs to a unique line. 


The most easily visualized projective plane is an infinite one based on 
the surface of a sphere. The lines and points of this projective plane 
are the great circles and pairs of diametrically opposite points on the 
sphere. A well-known technique (see Refs. 6, 7) uses a Galois field 
GF (q), where q is a prime power, to construct a projective plane having 
gq +q-+1 points and gq? + q + 1 lines. 

The code will be obtained by using certain points and lines of a pro- 
jective plane as the names of messages, keys, and bundles. First 
pick any line S to serve a special role. Using the sphere as a model, 
we call S the equator. Points on the equator will represent messages ~. 
Points not on the equator will represent keys m. Lines other than the 
equator represent encrypted messages y (bundles). Each x and m 
determines a unique line (not S because it contains m) which we use 
as the name of y in (1). 

Figure 3 shows the projective plane constructed from GF(2). It 
has 2? + 2+ 1 = 7 points. Six of the seven lines are shown as straight 
lines and the seventh, which we may take as the equator S, is a circle. 





Fig. 3—A projective plane. 
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The three points on S are the messages and the remaining four are 
keys. The six straight lines are bundles, containing two keys each. 
One can easily verify (7) and (iv) using (v) and (vz). Moreover, in 
the projective plane based on GF(q), g + 1 lines pass through each 
point and g + 1 points lie on each line. Each line different from S con- 
tains one message x and q keys. 
Each x lies on S and on q other lines. Then (72) and (iz) hold if 


q = Kt. (16) 


The equator contains N = 1+ q = 1+ K? points, as we expect from 
(15), and (iv) holds. When G uses this code, B will know that m is 
one of g keys on the line y. For any x’ ¥ z, these keys lie on q different 
lines through x’ and B has p; = 1/q = K-}. 

A Galois field GF(q) exists if and only if qg is a power of a prime, 
q = p”. Then (16) requires K to be an even power of a prime: K = p?” 
in this design. 


V. IMPLEMENTATION 


This section simplifies the code of Section IV into a form that is 
easily realized by a logic circuit. 

The usual construction for a projective plane begins by defining the 
points as vectors, having three components taken from GF(q). Two 
vectors v1, Ve are regarded as two names for the same point if they 
differ only by a scalar multiple, i.e., if ve = avi for some a € GF(q). 
The zero vector (0, 0, 0) is not used as a point. Lines are sets of points 
satisfying a linear homogeneous constraint. A line LZ can then be 
described by a nonzero vector L = (a, b, c) with the understanding that 
the points on L are the vectors v = (r, s, t) satisfying 


L-v=ar+bs+c = 0. 


Take the equator to be the line specified by the vector S = (0, 0, 1). 
Then messages x are points having third coordinate zero. By applying 
appropriate scalar multipliers, each x can be written cither as (0, 1, 0) 
or as (1, s, 0) with s © GF(q). The remaining points, which can be 
written in the standard form (7, 7, 1), are the q? keys. 

To make the logic circuit as simple as possible we agree not to use 
(0, 1, 0) as a message. There remain N = q = K? messages, all of the 
form (1, s, 0). The q lines through (1, s, 0) all have vectors (—s, 1, ¢) 
where 

st—j=e (17) 


holds for all keys (2, 7, 1) on the line. 
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Only a single element s of GF(q) need be transmitted to specify 
the vector (1, s, 0) and hence z. Likewise, the key input in Fig. 1 
requires only the pair (i, 7). The encrypted message y [a line with 
vector (—s, 1, c)] can be transmitted just as a pair (s, c). That amounts 
to using c as an authenticator z. The encoder is a computer which uses 
(17) to produce the authenticator value c from the inputs s, 7, 7. G 
uses a similar computer to test that his received s, c and known i, j 
satisfy (17). 

For example, the code obtained from the projective plane of Fig. 3 
is: 


message key encrypted message 
0 00 or 01 00 
10 or 11 01 
1 00 or 11 10 
01 or 10 11 


Again the code obtained from the projective plane with 13 points 
based on GF(3) = {0, 1, 2} is: 


message key encrypted message 
0 00, 01, 02 00 
10, 11, 12 01 
20, 21, 22 02 
1 00, 12, 21 10 
01, 10, 22 i 
02, 11, 20 12 
2 00, 11, 22 20 
02, 10, 21 21 
01, 12, 20 22 


Tables for constructing larger Galois fields will be found in Refs. 8, 
9, 10, and circuits for doing arithmetic in these fields in Refs. 10, 11, 
12. A field GF'(2') is convenient if the message originates in binary 
form. Then 2 and z each consist of b binary digits while 2b digits (b 
for 2 and b for 7) are required for the key. 


VI. BLOCK DESIGNS 


Projective planes are special cases of more complicated structures 
called balanced incomplete block designs (BIBD). The technique used 
in Section IV generalizes directly to produce new codes based on 
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BIBD’s. The new codes do not achieve p; = K-}, but they provide 
good solutions for some new values of K not of the form p?”. 

A (0, v, r, &, 4) BIBD is another system of points and sets of points. 
The sets are now called blocks instead of lines. There are v points in 
total and each block contains exactly k points. Each point belongs to 
r blocks and each pair of points is a subset of \ blocks. These condi- 
tions determine the number b of blocks. For bk = vr and r(k — 1) 
= A(v — 1) must hold in a BIBD (Ref. 6, p. 96; Ref. 7, p. 100). 


Examples: 


(1) The projective plane formed from GF(q) (see Section IV): 
b=v=Pt+qtlr=ak=aqtlaA=l. 

(2) The affine plane formed from GF (q) (Ref. 7, p.176):6 = @ + q, 
yv=@r=qtlk=qrA=1. 

(8) Many other examples are known: see, for example, Refs. 6, 7, 
13, 14, and recent volumes of the journals Sankhya, Annals of Mathe- 
matical Statistics, and the Journal of Combinatorial Theory. 


Given any BIBD with \ = 1, we may form a code as follows. Pro- 
ceeding as in Section IV, we select a particular block S to serve as the 
“equator.” Points on S will represent messages x. Points not on S 
will represent keys m. Blocks other than the equator represent 
encrypted messages y (bundles). Each x and m determines a unique 
block different from S which we use as the name of the y in (1). 

There are N =k messages, K =v—k keys, 6 — 1 encrypted 
messages, and k — 1 keys per bundle. Since \ = 1, the k — 1 keys in 
the bundle from x to y belong to distinct bundles leaving x’. Then 
po = 1/(k — 1) = 1/(N — 1). 

When the BIBD is a projective plane these formulas become again 
K=¢, N=1+K}, and p, = K-. For affine planes K = @ — q, 
N =q<1+4 Kt, and p, = 1/(q — 1) > K+. Thus, for given K, the 
affine plane has both smaller N and larger p, than one would expect 
from the projective plane. The larger p; should be expected since (72), 
(22) fail. 

To have (72), (i277) hold, r and & should be as close as possible. In 
most known BIBD’s other than the projective and affine planes, r 
and k& are considerably different. For example, consider the BIBD 
with parameters b = 195, v = 91, r = 15, k = 7, \ = 1 (number 111 
in Hall’s list’). The code obtained from this design has K = 84 keys, 
N = 7 messages, and p, = }. For comparison, the projective plane 
code based on GF (9) is superior on all counts, having K = 81, N = 10, 
and p, = 3. 
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Vil. RANDOM CODES 


The projective plane code in Section IV obtains p; = K-}, the 
smallest possible value, but it has only N = 1 + K?} messages. Codes 
with N > K have more interest. To see how large the corresponding 
p, might be, this section examines a code constructed at random. Now 
N can be made as large as desired. The main result will be that p; 
still need not exceed K~} by a large factor. 

The random code will have one free parameter A. Each x is allowed 
A possible encoded forms y. For each of the K keys the y in (1) is 
chosen at random from the A possibilities, all equally likely. The K 
choices are made independently. It may well happen that one of the 
A possibilities is never chosen in the K trials. In that case the code 
diagram, Fig. 2, will show fewer than A bundles from x. The code has a 
p, which depends on the random choices. We will look for the expected 
value E(p;). Specific codes, with the given N and K and having p,; 
less than this expectation, surely exist. 

All the data about ®(-,-) that B needs when substituting x’ for x 
are contained in a table showing how the encrypted messages y, 
depend on the key m. Figure 4 shows a convenient table as an A X A 
array of cells, each cell containing a list of all keys which determine a 
(y, y’) pair. Figure 4 corresponds to the pair of messages labeled z, 2’ 
in Fig. 2. Let v(y, y’) be the number of keys in the (y, y’) cell. 

Knowing y, B examines the corresponding column in Fig. 4. Since 
the K keys are equally likely, 


Pty’ |2, y, x’) = vty, y’)/ » v(y, Y1)- (18) 


The optimal strategy, by which B achieves (7), is to pick y, to maxi- 
mize v(y, y’). In Fig. 4 the row y, intersects the y column in a cell 
with the largest number of keys. There may be k > 1 such cells in 
the y column, in which case B may as well pick one of the k rows 
equally likely, at random. 

E(p,) can now be described as the solution to a distribution problem. 
Imagine that the correct key is key #1 and that it occupies the cell in 
column 1 and row 1. Distribute the K — 1 remaining keys at random 





Fig. 4—Table of keys. 


CODES WHICH DETECT DECEPTION 417 


over the A? cells. Let pn, be the probability that the (1, 1) cell con- 
tains v(1, 1) = n keys, that k — 1 other cells in column 1 contains n 
keys, and that moreover all of the A — k remaining cells in column 1 
contain fewer than n keys. Then 


E(p,) = a (19) 


is the probability that B picks the first row for y,. 

The exact formula for p,,, 1s cumbersome. It is not hard to simulate 
the distribution experiment on a computer in order to estimate E (p,) 
when K is less than a few hundred. This has been done, but only as a 
check on the simpler approximate calculation which follows. 

When A is large, each key has a small probability A~* of belonging 
to the cell (y, y’). After a large number K — 1 of independent trials, 
the number v(y, y’) of keys in the cell will have approximately a Poisson 
distribution with mean 


Pete HA) /ae. (20) 


Accordingly, we treat numbers v(y, y’) as independent Poisson random 
variables with mean \. The number (1, 1) is special because we started 
the distribution by placing key #1 in cell (1, 1); v(1, 1) — 1 is the 
Poisson variable for this cell. Poisson approximation has the dis- 
advantage that the total number of keys >(,,, »(y, y’) is itself a 
random variable. However, the mean number of keys is K and there 
is high probability that there will be close to K keys if K is large. 
The effect of this approximation should be worse for small K than for 
large K. The Poisson approximation and the simulation do give the 
same EH (p;) to within a few percent even for K = 25. 








Table |—E(po) for random designs 
A= is 1 1 4 16 K-3 
K = 25 0.47 0.44 0.54 0.2 
64 0.46 0.34 0.32 0.38 0.57 0.125 
100 0.40 0.29 0.27 0.32 0.46 0.1 
256 0.27 0.21 0.19 0.22 0.32 0.06 
400 0.23 0.17 0.16 0.18 0.26 0.05 
1,024 0.15 0.12 0.11 0.12 0.03 
4,096 0.087 0.069 0.062 0.068 0.092 0.015 
10,000 0.062 0.047 0.042 0.046 0.061 0.01 
40,000 0.036 0.026 0.023 0.024 0.032 0.005 
100,000 0.025 0.018 0.015 0.016 0.021 0.003 
1,045,576 0.0084 | 0.0063 | 0.0054 | 0.0055 | 0.0069 0.001 
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To simplify writing an expression for pn,x, let b, and B, denote the 
probabilities that a Poisson random variable has value exactly n or 
at most n. 


bn = Ate-*/n! 
Ba = bo + bi +--*+ Da. 
Then 
Dn k = D»-abn Bait (#27). (21) 


In (21), b,-1 is the probability that cell (1, 1) contains n keys, 
bk-! BA-¥ is the probability that a particular set of k — 1 other cells 
have n keys but all A — k others have n — 1 keys or less, and the 
binomial coefficient counts the different sets of k — 1 cells. Now 
insert (21) into (19) and sum on k to get 


E(p) = = (n/\A){ BA — Bay}. (22) 


Table I gives values of E(p,), computed from (22). For fixed K, 
a broad minimum of E(p;) occurs near \ = 1. Then (20) shows that the 
minimum occurs when A = K?, approximately. Thus, even when G 
designs his code by random means, he should pick A to make (77) and 
(iit) of Section IV hold as nearly as possible. 

Although (22) is only an approximate solution to the problem, it is 
also a generating function for the exact solution. Let e(K) denote the 
exact expected value of p; when the number of keys is K. Instead of 
e(K), eq. (22) provides 


(AA?) A> 


TK 1D!I (K — 1)! 1)! 1 = eXD (—)A*)e(K), 


1e., a sum of terms e(K) weighted by the probability that the Poisson 
experiment produces K — 1 keys in addition to key #1. In principle, 
one could multiply the sum in (22) by exp (AA?), expand the result 
into a series in powers of A, and identify the coefficient of \*— as 
A?(KDe(K)/(K — 1)!. The result for e(K) is unpleasant and (22) is 
accurate enough. In an experiment to estimate e(64), 2000 trials were 
made for each of \ = 4, 1, 4. The fractions of trials in which B suc- 
ceeded were 0.31, 0.30, 0.37. 


Vill. SYSTEMATIC CODES 


This section constructs a systematic code with large N by means of 
another generalization of the projective plane code of Section IV. 
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Fig. 5—Code designed from projective space of dimension M. 


Unlike the random code, which had N as a free parameter, this code 
will specify a particular N. That disadvantage is offset by a smaller 
value of E(p;) and by a more important advantage discussed in 
Section IX. 

Figure 5 will illustrate the code design. Given a field GF(q), one 
can construct a projective space PG(M, q) of dimension M in which 
points are again equivalence classes of nonzero vectors, now having 
M + 1 components. M = 3 in Fig. 5. The number of points is 


f(M) = ( -—D/q@-W=1tate-ta% (23) 


Each set of points satisfying a system of M — D independent linear 
homogeneous equations is a D-dimensional subspace PG(D, q) con- 
taining f(D) of the points of PG(M, q). The number of D-dimensional 
subspaces of PG(M, q) is® 


f(M)f(M — 1)---f(M — D) 

f(D) f(D — 1)--- FO) 
Z (qui ae 1) (q¥ = 1)- be (quti—D = Ly 
(PS gg) 


Proceeding as in Sections IV and VI, we again select a particular 
subspace S of dimension M — 1 to serve as the ‘“‘equator.”’ In Fig. 5, 
S is a projective plane. We again identify messages x with subspaces 
of S. But now S has subspaces of dimension 0, 1, ---, M — 2 and so 
we can specify the dimension s of the messages as another parameter of 
the design. In Fig. 5, s = 0; another code might use s = 1. Given 
M, s, the number of distinct messages is 


N = g(s, M — 1). (25) 


g(D, M) = 


(24) 
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Again, the points not in S will be keys. There are 


LS f(M) = Fd) Gg (26) 
keys. 

The key m (a point) and message x (of dimension s) determine a 
unique (s + 1)-dimensional space which will represent y. Since y 
has f(s + 1) points and f(s) of them belong to S, y contains f(s + 1) 
— f(s) = qs" keys. Now (22), (az) of Section IV need not hold. 
Instead, for each x, the q™¥ keys fall into 


A = q¥-=1 (27) 
bundles of 
K/A = qe 


keys each. In Fig. 5, A = q@?, K/A = q. 

To find p, consider the matrix, Fig. 4, corresponding to a particular 
pair x, x’. The q**! keys in a given column y need not be distributed one 
to a row [as in (2) of Section IV]. Each cell in the matrix contains 
all the keys belonging to an intersection between (s + 1)-dimensional 
spaces through x and 2’. If x and 2’ themselves intersect in an r-dimen- 
sional space x () x’ then the cell contains the g**! keys of an (r + 1)- 
dimensional space through xf) 2’. B must choose one of g*t!/q7t! 
= q*" equally likely rows; his probability of correctly guessing y’ is 


Po(@, y, x’) = qr *. (28) 
Now (6) and (28) provide 
Po = L Arq, (29) 


where h(r) is the probability that a randomly chosen x’ intersects a 
specific x in a space of dimension r. In (29), the range of summation is 
23+1—Msrss-—1 provided 2s+1 2 M. Butif 2s+1< M, 
as in Fig. 5, then x () x’ can be empty. In that case the summation 
(29) extends over -1 Sr Ss—l. 

We now show 


AG) Sg O9 R= = 1 Mise 2g 8) / 

which together with (24) and (29) gives p;. The factor g(r, s) in (30) 
is the number of different r-dimensional subspaces of x; it suffices to 
show that the remaining terms of (380) give the probability that a 
randomly chosen x’ intersects x in a particular subspace H of dimen- 
sion r. Given x, and a subspace H, we can find M basis vectors éo, é1, 
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-++, @y for S such that eo, e1, ---, e- span H, and é, é1, ---, és span x. 
Each x’ contains H and so has a basis containing é, ---, e,. The re- 
maining s — r basis vectors of x’ can have the form 


M 


y= »: £543, t= rp, +s, 


j=rtl 


in which é, ¢1, ---, €¢- do not appear. In determining £;,; one must not 
allow x’ to intersect x in a space of dimension larger than r. This 
requirement is equivalent to a condition that the partial sums 


M 


v= Dd £4.58; tz=r+1,---,8, 


j=stl 


of v; be linearly independent. Then the v7 span an (s — r — 1)-di- 
mensional subspace x° of the (M — s — 2)-dimensional subspace S° 


spanned by és41, -+:, @ar- The factor g(r —s—1, M — 8 — 2) in 
(80) is the number of ways of choosing x’. Having chosen H and x° 
(and hence £,; for 7 = s+ 1, ---, M), the (s — r)? numbers 

Ei; tr PL, s+, 85 \ ok oes oe ae 


can be chosen in g‘*-”” ways to specify x’ completely. Now the numer- 
ator in (80) is the number of ways of picking an v’ to have an r-dimen- 
sional intersection with x and the denominator is the number N — 1 
of messages (different from x) from which B chooses 2’. 

Now q, M, and s determine N, K, A, p;. Table II gives some of the 
better designs obtained by taking g = 2. These all have M = 2s + 2, 
so that K/A? = 1 follows from (26) and (27). For given K, the least 


Table Il— Designs with q=2 


Dimensions Keys Inputs Prob (B wins) 
M Ss K N Po 

2 0 4 3 0.6666 

4 1 16 35 0.400 

6 2 64 1,395 0.2222 

8 3 256 200,787 0.1176 
10 4 1,024 1.09 X 108 0.0606 
12 5 4,096 2.3 X 104 0.0308 
14 6 16,384 2X 1045 0.0155 
16 7 65,536 6 xX 109 0.0078 
18 8 262,144 8 X 1074 0.0039 
20 9 1,048,576 4 X 10° 0.00195 
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Table lll— Design with q= 2, M=12, s=5 


r= dim (x 1) 2’) h(r) Do(2, Y; x’) 
—1 0.3979 0.015625 
0 0.5773 0.03125 
1 0.1204 0.0625 
2 0.00432 0.125 
3 2.9 X 107° 0.25 
4 3.4 X 10-8 0.5 


p, was always obtained when K/A? = 1; a similar phenomenon was 
encountered with random designs having \ = 1 [ef. eqs. (20) ]. The 
table contains codes having N much larger than K. At the same time, 
p, is approximately 2/K?, which compares well with the projective 
plane code. 


IX. CHOICE OF x’ 


Until now B had no control over the choice of x’. We treated x’ 
as a random variable which B accepts as given. But suppose that B 
has no particular z’ in mind; he merely wants to mislead G by sub- 
stituting any convenient wrong message zx’. An optimal strategy for B 
must again achieve (7) but B will select x’ to maximize p,(a, y, x’) for 
each given x, y. 

A code with small p;, for randomly chosen x’, may now be a poor one. 
Table III shows more detail about the code with q = 2, M = 12, 
s = 5 in Table II. This code had p; = 0.0308, as computed from (29). 
But some false messages x’ intersect x in spaces of dimension r = 4; 
if B substitutes one of these, his chance of success is 0.5 [eq. (28) ]. 


Table IV — Effect of changing field, keeping key size 
approximately fixed 


: : . Prob (B wins) 
Field | Dimensions} Key bits | ¢ /A2 Msg bits 





q M ss loge K log:N |. 
ifr =s-—l1]| averaged 

256 2 0 16 1 8.01 0.0039 0.0039 
41 3 1 16.08 41 10.7 0.0244 0.0250 
16 4 1 16 1 16.1 0.0625 0.0078 
9 5 2 15.9 9 19.2 0.1111 0.0137 
7 6 2 16.86 i! 25.5 0.1429 0.0058 
5 7 3 16.24 5 28.3 0.2000 0.0096 
4 8 3 16 1 32.5 0.2500 0.0078 
3 10 4 15.9 1 40.5 0.3333 0.0082 
2 16 7 16 1 65.9 0.5000 0.0078 
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The code is good for randomly chosen x’ only because B usually has a 
message x’ with r = — 1 or 0. 

A good code for G must now have p,(a, y, x’) small uniformly, not 
just on the average. The code of Section VIII achieves this if q¢ is 
large. For (28) shows p.(z, y, x’) S$ 1/q. Unfortunately for G, in- 
creasing qg has the effect of decreasing N. Then G must compromise, 
picking q small enough to obtain large N but large enough so that 
B’s chance of success, 1/q, is tolerably small. Table IV shows a typical 
tradeoff between N and 1/q. The designs in Table IV all have approxi- 
mately the same key size K = 216 Table IV shows both probabilities 
of success for B, 1/q if B makes r = s — 1 and the averaged value 
(29) if B picks x’ at random. If one ignores the designs with K/A? # 1, 
the averaged probability doesn’t change much. To reduce 1/q from 
0.5 to 0.1 reduces the message size, log N, by a factor of 3. 
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Queues arising in buffers due to either random interruptions of the 
channel or variable source rates are analyzed in the framework of a single 
digital system. Two motivating applications are: (1) multiplexing of data 
with speech on telephone channels and (12) buffering of data generated by 
the coding of moving images in Picturephone® service. 

In the model a source feeds data to a buffer at a unsform rate. The 
buffer’s access to a channel with fixed maximum rate of transmission is 
controlled by a switch; only when the switch is closed (‘‘on’’) ts the buffer 
able to discharge. The on-off sequence of the switch is indicated by a 
burst process which is a key element in this paper. In such a process, 
long periods during which the switch stays closed alternate with periods, 
called bursts, during which the on-off sequence is a first-order Markov 
process. The length of a burst 1s randomly distributed. This is a general- 
ization of the memoryless burst process considered in an earlier paper.} 
In that paper we gave formulas for the efficient computation of various 
functionals of the queues arising in the system. Now we extend these 
formulas to hold for the generalized class of burst processes. 


I. INTRODUCTION 


In a recent paper! we considered the problem of buffering the output 
of a uniform source whose access to a given transmission channel is 
controlled by a burst process. We gave formulas for efficiently com- 
puting various functionals of queues that form in such a communica- 
tion system when the controlling burst process is memoryless. 

In the present paper we generalize the controlling process to one 
which is first-order Markov within a burst. This generalization con- 
siderably increases the usefulness of the formulas. Consider, for ex- 
ample, the two motivating applications discussed in Ref. 1: (2) multi- 


“The sequence of names was decided by coin tossing. 
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plexing of data with speech on telephone channels?“ and (72) buffering 
of data generated by the coding of moving images in Picturephone® 
service.’ For the first application, analysis of data shows? that it is 
necessary to go to a first-order Markov process to adequately model 
the burst phenomena in speech signals. In the Picturephone applica- 
tion, although the correlation of data rates within a frame is negligible, 
it is quite significant from frame to frame.* For frame-to-frame coding, 
therefore, the present model with memory becomes necessary. 

The system under consideration is shown in Fig. 1. The source emits 
data uniformly at the rate of 1 symbol per unit time. The transmission 
rate of the channel is (k + 1) symbols per unit time, where k& is some 
positive integer. The on-off pattern of the switch is indicated by a 
binary burst process: H(j) is either 0 or 1 for j = 0, 1, 2, ---. If 
E(j) = 0 the switch is closed for the time duration [j, 7 + 1); other- 
wise, the switch is open. We assume that there are long periods during 
which #(7) = 0 and that at the end of every such period the buffer is 
empty. The activity separated by such periods we call a burst. We 
assume bursts to be independent of each other, and the burst length 
to have a probability distribution which is either geometric or is a 
weighted sum of geometric distributions. Within a burst, {#(j)} is 
assumed to be a homogeneous two-state M arkov chain with transition 
probabilities 6, and @ given by 


6, & Prob. {E(j + 1) = 1|E(j) = 0} (1a) 
6. & Prob. {E(j +1) = O/E(j) = 1}, 7 =0,1,2,---. (1b) 


These two parameters completely specify the Markov chain; the 
probabilities of the other two possible transitions are, of course, given 


by 


1 — 6, = Prob. {F#(j + 1) = 0|E(j) = 0} 
and 
1 — 6 = Prob. {E(j + 1) = 1|E(y) = 1}. 


We shall assume that 0 < 6; < 1 and 0 < 6 < 1. If 6, + 6. = 1, E; 
becomes a Bernoulli sequence of independent random variables, which 
is the case treated in Ref. 1. 

In subsequent sections of this paper we will obtain the results sum- 
marized below. 

In (2), (77), and (777), we assume the switch to be controlled by an 
infinitely long sequence generated by the Markov chain described by 
(1) ; these three results are therefore of interest in situations where the 
distribution of burst lengths is not known accurately. 
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UNIFORM 
SOURCE BUFFER SWITCH CHANNEL 


Fig. 1—Switched communication system. 


(7) We derive a recursive formula for the steady-state distribution 
of buffer content for finite buffers, the recursion being with respect to 
the buffer size, NV. 

(zt) Let T°? be the steady-state probability of a buffer of size N 
being full when the channel is inaccessible. (7, therefore, is the 
steady-state probability of a transmission fault.) We show that 


1 = oe 1 1—6,—6 1 1—6, 1 
TN+EH) ~ 7 — 6, TON+E) l= TOM t= 3,7? 








where (k + 1), as previously defined, is the transmission rate of the 
channel. We show that the steady-state probability of the buffer 
being full is 7“/(1 — 62), and therefore satisfies the same recursive 
relation. 

(iit) For a buffer of size greater than N, let PF“ denote the mean 
time to first passage through the level N. We show that Ff“ satisfies 
the recursion 


1 — 601 — 0 
(N+k+1) — — (N-+k) ie (N+1) 
. Ch ie ia 


1s — + + 62 
oo EO Bye Ase resey 

The next two results are of interest when the distribution of burst 
lengths is well-approximated by a weighted sum of geometric 
distributions. 

(tv) Let G™ be the probability of overflow for a buffer of size N 
during a burst. Then if the burst lengths have a geometrical prob- 
ability distribution with parameter p{i.e., Prob. (burst length = 7) 


= p*1(1 — p)}, we show that 


pe ate eee et eh A ed 
(NFEH) == (NFR) = WH)” T — 9, @™M 
G ~ pd — 6) G I-® G 2 


This result generalizes to the case when the burst length distribution 
is a sum of geometric distributions. 

(v) We derive a closed expression as well as a recursive formula for 
the mean time for first passage through a level N during a burst 
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conditioned on the occurrence of an overflow. The recursion is with 
respect to N, and the bursts are assumed to be distributed as in (7v). 

(vt) We determine the asymptotic behavior of all the formulas 
in (2) to (v) as N— o. For instance, we prove that, as N > o, 
(1/G() ~ s%, where s is the unique positive real root of a particular 
polynomial, such that s > 1/p > 1. 


The closed expressions are all valid for k 2 1 and N = O, and the 
recursions as stated above are valid for N = 0. The recursive formulas 
provide very efficient means for computation of the various functionals, 
particularly in design studies where a whole range of buffer sizes is to 
be investigated. 


1.1 Notation 


Whenever necessary we will use a superscript in parentheses, e.g., 
x“), to indicate that the quantity corresponds to a buffer of size M@ 
(or to the level M in a buffer of size greater than M). If x is a vector, 
then the superscript (/) will also indicate that the vector x“ is 
(M + 1)-dimensional with components z/”, 7 = 0, 1, 2, ---, M. These 
two uses of the superscript are consistent betaiise the dimensions of 
all vectors defined in this paper are related to buffer size (level) in this 
manner. Whenever the superscript is missing, the standard value (NV) 
will be implied. 

We will use lower-case boldface letters to denote column vectors, 


upper-case boldface letters to denote matrixes, and a superscript T' to 


denote the transpose. We will dencte by I the identity matrix, by 1 


the vector whose components are all equal to 1, and by e; the vector 
whose jth component is 1 and the rest 0, e.g., eg = (1, 0, ---, 0). 


1 


Il. EQUATIONS OF THE PROCESSES 


Let B(t) be the number of symbols in the buffer at time ¢. Then for 
a buffer of size N 


Bit + 1) 


Max [B(t)—k,0] if H(t) =0 (2a) 
=Min[BO+1,N] if FE =1. (2b) 
In the last equation the assumption is that if the channel is inaccessible 


and the buffer is full, then the current source symbol is discarded and 
the buffer remains full. 


In order to study the evolution of the buffer content process, it is 
convenient to introduce two (N + 1)-dimensional vectors p(t) 
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= {p.(t), ---, pw(t)} and q(t) = {q.(é), ---, gv(t)} defined by the 
equations 


pi(t) = Pr {B(t) = 7, E(t) = 0}, 1=0,:::,N (3a) 
git) £ Pr {B(t) =7,E() = 1}, i=0,---,N. (8b) 


Under the assumption that {£(é)} is the two-state Markov chain de- 
fined by (1), it is straightforward to show that p(t) and q(¢) represent 
a 2(N + 1)-state homogeneous Markov chain. For 


A 


po(t + 1) 


Pr {B+ 1) = 0, E(¢ + 1) = 0} 


¥ Pr (Bi) = i, EW +1) = 0, E® = 0} 


= ¥ Pr {EU +1) = 0/E = 0, BO = 4) 
x Pr {B(t) =i, E() = 0} 


= (1-4) ¥ pild, (4) 


where the last step follows from the Markov property of {H(t)}. 
Similarly, 


pill ++ 1) _ (1 — 91) pizx(t) alr O2q:-1(t), t= 1, 2, “P35 oe k, 
= O0q:-1(b), t~=N—k+1,---,N —-1, 
= O2:{q_-i(t) + gn (O)}, t=QN. (5) 


Also 
k 

qaét@+til)=A d pill), 7 = 0, 
i= 


= 61pi+%-(t) as (1 co 92)qi-1 (2), 1= Ms 2, sy ey N= k, 
=> (1 — O2)qi-1(0), 17=N—k-+1, +N — it. 
=(1-f){aa® +a}, tN. (6) 


Equations (4), (5), and (6) can be written conveniently in matrix 
notation as 


p(t¢+1) = (1 — 6,)Bp(t) + 62:Aq(t) (7a) 
q(t + 1) = 6:Bp(t) + (1 — 62)Aq(t). (7b) 
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Here the (NV + 1) X (N + 1) matrixes B and A are defined as 


(ke +1) 

7 

ie dase 070 0 0 

1 i 1 0 
A Bog : Lire 
B= 0 Lie: ae 9) [7 8) 

011 
N 


Notice that the composite matrix 


(1 = 6,)B oA : (9) 
6,B (1 — 62)A 
is stochastic (nonnegative elements and evcry column sums to 1) and 
independent of t. Equations (7a), (7b) are, therefore, the transition 
equations of a 2(N + 1)-state homogeneous Markov chain. 


2.1 Equations for some new probabilities 


For many of the derivations in the succeeding sections (e.g., mean 
first passage time, probability of no overflow, etc.) it is convenient to 
define certain new probabilities r;(¢) and s;(¢), 7 = 0, 1, ---, N. Con- 
sider a buffer of size greater than N and let X(t) be the event 
(\s=0 {B(s) S N}, ie., the event that B(s) does not exceed N at any 
of the time instants s = 0, 1, 2, ---, ¢. Then 


r;(t) 


s(t) £ Pr {B(t) =i, E(t) =1,X@f, i=0,---,N. (10b) 


We define the (N + 1)-dimensional vectors r(t) and s(¢) with com- 
ponents {r.(t), ---, rv(t)} and. {s.(t), ---, sw(t)}, respectively. 

In a manner analogous to the derivation of eqs. (7a) and (7b), 
we can derive recurrence relations giving r(t + 1), s(¢ + 1) in terms 
of r(t), s(t). Thus, forz = 0,1, ---, N, 


r(t +1) & Pr {Be +1) =i, Et +1) =0,X¢+D)} 
= Pr{B(tt+ 1) =7, E(t +1) = 0, X(t} 
= (1 — 0) Pr {B(t+ 1) =i, E() = 0, X(} 
+ 6. Pr {B(t +1) =7, E(t) = 1, X()}, (AD 


where the last equation follows from the Markov property of {H(t)}. 
As before, B(t + 1) and E(t) determine the possible values of B(t) 
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and we get 
k 

r(é+ 1) = (1 — fr) » r;(Z), z= 0, 
= 


ees 61) ris (t) T O28;_-1(t), += I, By eae Ne k, 
Oo8;-1(b), t= N —k + 1, eee S N. (12) 


Il 


Comparison of eq. (12) with eqs. (4) and (5) shows that for 
eee $644) 290 SOB + FASO, (13) 
where A is obtained from A by setting to 0 the single nonzero entry 
on its main diagonal, i.e., 
A =A — ened. (14) 
Analogously to (13) we can also show that 
s(t +1) = @:Br(t) + (1 — 42)As(Z). (15) 


The transition equations (13) and (15), although very similar to 
eqs. (7a) and (7b), differ fundamentally from them in that A, and 
consequently the matrix 


(1 — 61)B 6A 
| oB  (1—6)A)’ 16) 
are not stochastic. 

We close this section by deriving from (13) and (15) a useful second- 
order recursion involving s(é+ 2), s(¢-+ 1), and s(t). Multiplying 
(13) by 61, (15) by (#1 — 1), and adding we get 

Or(t{+1) = (1 —4@)s¢4+ 1) — (1 — 4 — 62)AS(E). (17) 
From (15), 
s(¢-+ 2) = 6Br(t+ 1) + (1 — 6,)As(¢ + 1). (18) 
Premultiplying (17) by B and adding to (18) gives 
s(t + 2) = [C1 — 0)B + (1 — 6:)A]s(@ + 1) — (1 — 41 — 62)BAs(}), 
t=0,1,2,---. (19) 
As we will have to refer frequently to the recursion (19) it is convenient 
to define 
c 2[( - 6)B + (1 — 4)A] 
and (20) 
D = — (1 — 4; — 62)BA 


I> 
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so that eq. (19) becomes 
s(t + 2) = Cs(é+ 1) + Ds(d), t¢=0,1,2,---. (21) 


lll. INFINITELY LONG SEQUENCES 


When the burst length distribution is not known, useful information 
can still be obtained by considering the behavior of the buffer content 
when the switch in Fig. 1 is controlled by infinitely long sequences 
generated by the Markov chain (1). In this section we derive various 
functionals for such a situation. 


3.1 Stationary distributions for finite buffers 
In eqs. (7a), (7b), if we set pt+1) = p(t) =p and q(t +1) 
= q(t) = q, then the vectors p = {po, ---, pw} andq = {po, ---, gv} 
give the limiting distributions? as t > © of the buffer content process 
defined in Section II. The limiting distributions p, q are thus the solu- 
tions of 
p = (1 — 6:)Bp + 62Aq (22a) 


q = 6,Bp + (1 — 62)Aq (22b) 
with, of course, the normalization 
i7(p + q) = 1. (23) 


In this section we derive a simple formula for computing the vectors 
p and q for a given buffer size (NV + 1) in terms of p and q for a buffer 
of size N. As a first step we simplify the problem by eliminating p from 
eqs. (22a), (22b). Multiplylng (22a) by 6; and (22b) by (6: — 1) and 


adding gives 
p= =o )q - hg (24) 
6, 6, 





Substituting (24) into (22b) gives 
(I — (1-—6)B — (1 —@.)A + (1 — 6, — 6.)BA]q = 0. (25) 
Premultiplying (24) by 17 and subtracting from (23) gives 


61 
17q = 26 
61 + 42 79) 
since 17A = 17. It is important to note that the N + 1 component 
equations in (25) are not independent. Indeed, since 17A = 17B = 17, 
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it is clear that the first equation is just the sum of the rest and may 
therefore be ignored. The remaining N equations are linearly inde- 
pendent and we can solve them for qo, ---, gy—1 in terms of qy, and 
then obtain gy from (26). Finally, we can obtain p from (24). 

In carrying out the solution of (25) and (26) in this manner the 
recursion we are looking for becomes obvious if we define the (N + 1)- 
dimensional vector y‘”? with components given by* 


OP a = qe i/ Qh”, t= 0, mi N. (27) 


[ The meaning of the superscript (JN) is given in Section 1.1. ] Equations 
(25) and (27) give 





yo? = 1 (28a) 
6, 
(NY) = 
Yi [6 (28b) 
wy YA ee) ee 28 
Yi 1 6,” a ’ ’ ( c) 


1-—0:— w os 








(N) 
ys = e ++ 


6 ite ee 





t= 
at 1—6,-—6 1 — 
(Ny) — _Yi-1 1 2,(N) _ Ny, . 
Yi i-a7 Too 2 TS au 4, t>k+1. (28e) 


The important fact about (28) is that the superscript (NV) is superfluous. 
If N is changed to N + 1, for instance, in (28) we see that 


yn = ye, i= 0, rey N, (29) 
and the last component of y‘"* is 


1 6, — 0 L= 20 
YEE = Tay + Sa a a 8, yarn (30) 








Thus the vector y‘"* is obtained from y‘”) by merely appending to the 
components of y“) one component given by (30). To complete the 
recursion for q(t), we note from (26) and (27) that 


1 _~f{ath Mat (N-+1) 
ghi> ( 91 ) X i oD 








*Note that qi” # 0, for otherwise the solution g of (25) is the null vector which 
cannot satisfy (26). 
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and therefore, from (29) and (80), 








1 1 61 + 62 
ae ~ gy Ta, URE 
_ 1, ate 
qs . 6i:(1 — 62) 


x Ty + (1 =O 62) y P21 —k (1 = O)y (32) 


Equation (32) gives gf» in terms of the components of q‘?. 
+ 


3.2 Probability of transmission fault and of buffer being full 


Frequently it is adequate to determine the variation with buffer 
size of the components p” and gq rather than of the complete dis- 
tributions p©&) and q. Notice that the probability of transmission 
fault 7’ is, by the definition given in Section I, identical to aN”; 
and the Drobabiliey that a buffer of size N is full is slearly pM + q™ 
It is therefore of interest to obtain recursions for these quantities 
without having to compute the entire p and q vectors from the re- 
cursions derived in Section 3.1. 











By premultiplying eqs. (22a) and (22b) by eZ(£{0, 0, ---, 0, 1}) 
we get 
ep = 6e%Aq = "2 efq (33) 
1 — 6, 
or 
mee TN) fag as 
ex (p as q) = — 0 — —_ 6s? (ot) 
1.€., 
M 4+ 9g = TON), 
‘i 1 — 65 


It therefore suffices to obtain a recursion for 7. Suppressing the 
superscript (N) from (28e), and summing over the index 7 from k + 2 
toN +k + 1, we get 





N+k+1 1 N+k 1 — 6, — 6, N41 
i=k+2 ey = Be me wer 05: m8 ue 
Be ese: 


Since 7) = qf, (31) is used to relate T to {y,;}. Now substituting 
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the values of {y:} given in (28) we obtain 


1 1 1 Li Oi 0s, “1 i Gage ee 
TNH) ~~ [7 — 69,7) ~ 1 — 6, TOAD Fr 1 — 6,7) 


N21. (86) 


= 0, 


Equation (36) is the recursion quoted in Section I. 


3.3 Mean first passage time 


Let N be a positive integer and let the buffer be of size greater than 
N. Let an infinitely long burst start at ¢ = 0, with the buffer initially 
empty, and let F“ denote the mean time required for the buffer 
content to first exceed N. The manner in which F'“”) depends on N 
is a useful guide in designing an adequate buffer, especially when the 
distribution of burst lengths is not accurately known. In this section 
we derive a recursive formula for F'“”, the recursion being with respect 
to the level N. 

By definition, the Nth component of the vector s(¢) defined in eq. 
(10b) is the probability that the level NV is exceeded for the first time 
at the instant ¢ + 1. Therefore, 


FO) 3 (t + sy (2) 


ef » (t + 1)s(t). (37) 


In the appendix we show that if \ is an eigenvalue of the matrix de- 
fined in (16), then |\| < 1. This proves the convergence of the series 
in (37). 

We proceed by obtaining an expression for >0 25 (¢ + 1)s(¢) by the 
method of generating functions. Let 


S(e) 2 ¥ zs) (38) 
so that 7 
S'() = F + Lets) (39) 
and, in particular, ; 
S(1) = 5 @ + Isto. (40) 


FORMULAS ON QUEUES—II 435 


From the equation, (21), governing the evolution of {s(¢)} we find that 
S(z) = LJ -— eC — 2D} "{2s(0) + 2s(1) — 2Cs(0)}. (41) 
It is shown in the appendix that the above matrix inverse exists for 
all |z| S$ 1. Following the procedure already outlined [eqs. (39) and 
(40) | we find that 
¥ (t+1)s() =[I-C-D}[C+2D ]TI-C—D]}{s (0) +s (1) —Cs(0)} 
t=0 
+[I—C—D]"{s(0)+2s(1)—2Cs(0)}. (42) 


The resulting expression for /“, from (87) and (42), is further 
simplified by using the following identities: 


ef = 17(I-C—D] 
; 1 


and 
17[C + 2D] = (6 + &)17 + (1 — 6. — 2,)e%. 
Then 
poy = BE yar — C —D}{s(0) — (1 — 6)Bs(0) + @,Br(0)} 


+ (17r(0) — 62)/61. (48) 


The above expression for Ff“? holds for arbitrary initial states of the 
buffer. However, as mentioned in the beginning of this section, in 
deriving a recursive formula for /“” we wiil assume the buffer empty 
at ¢ = 0. In that case, r(0) = rep and s(0) = (1 — r)eo with re[{0, 1]. 
Substituting in (43) we get, for this special case, 





POO = (0). 02)17 T=: C= D) (44) 
We can derive a recursion for the quantity 
gm 21711 —C — D)“eo (45) 


from which the recursion for Ff“? will follow immediately. The pro- 
cedure is very similar to the one used to derive (36). Thus let 
x? = (a, %1, -+*, Zw) be the solution of 


(I=C—D)x =e. (46) 


Then, since 17(I — C — D) = 6;e%, we get ty = 1/0,. We may re- 
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place the first of the component equations in (46) by this relation. 
Exactly as in (27) and (28), we find that the components z{ (¢ = 0, 
--+, N) of the vector x‘? are, in reverse order, the first N + 1 numbers 
#; in the sequence generated as follows: 





pes (47a) 
Ay 

PPO By Beis (47b) 

7 i= Ao t—l, rae Ra 7) 

Be tae as TM De Oe 302 2 Tee Bi i. , 

t= l= 6 a Ne es Lik f= 7 vik, t~>k. (47e) 


Summing (47c) over ¢ from k +1 to N+k+1 and noting that 
f™M = Yo x, we get 





1 1 — 0, — 62 1 — 6; 
(weet) — — + pevgny — 2-7 81 7 82 pg) — 4 7 71 py 
f t—o,/ 1 — 05 f =a) 
= k a ] kr-1 | 1—6;— 62, 
= 2% 1 — 62 fo * 1 — 6, ' 
1 
~~ 1 — 9,7 (48) 


where the last step follows from (47a), (47b). However, 
PE TEM = (a= O2)/81}/ i+ 62). 
Substituting in (48) we get 





1 1 — 6, — & 1-6, 
(N+k+1) — __ ss (N+) Se EE ACN Y) 
z i Ae paar, i — 6 
fe 01 ater (08) 
t= 6; 


Interestingly, + does not appear explicitly in the recursion (49); it 
does, of course, affect the initial conditions [1. e., the values of F®, 
-++, FO] via eq. (44). 

It is interesting to note that the forcing term (01 + 62)/(1 — 62) in 
(49) can be eliminated. By direct substitution it is seen that if 61 4 k6. 
then F™) — (6; + 62)N/(01 — 62) satisfies the homogeneous recursion 
(49). When @,;=k6s, the same is true of FPF“? — (0:+62)N?/k(2—01—62). 
These transformations which reduce (49) to the homogeneous form 
will be of use when we investigate the asymptotics of solutions in 
Section V. 
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IV. BURSTS WITH GEOMETRICALLY DISTRIBUTED LENGTHS 


When information is available concerning the distribution of burst 
lengths we can compute design parameters which are more realistic 
than the quantities 7”? and F“™ discussed in the preceding sections. 
Clearly an event is of consequence only if it occurs within a burst. 
Its probability of occurrence at the fth instant must therefore be 
weighted by the probability that the burst length exceeds ¢t. If the 
distribution of burst lengths is the weighted sum of geometric dis- 
tributions, i.e., 


J 
Prob. {Burst length = 7} = ¥° B.(1 — px)pi?, 
k=0 


t7=1,2,---; O<p.e <1, (80) 


then simple recursions can be obtained for such weighted averages. To 
keep the derivations simple we have only treated the case J = 1 
since, as shown in Ref. 1, generalization to higher values of J is straight- 
forward. In Sections 4.1 and 4.2 we derive such recursions for the 
probability of overflow within a burst and for the mean time to first 
cross a level within a burst. 


4.1 Overflow within a burst 


For a buffer of size greater than N let G“ denote the probability 
that the buffer content exceeds N (at least once) during a burst. It 
is clear that G“ also equals the probability that a transmission fault 
occurs (at least once) during a burst, when the buffer size is N. We call 
G) the probability of overflow. 

By its definition in (10), sy(¢) is the probability that the buffer 
content exceeds N for the first time at t + 1. Therefore, 


Gm & ¥ sy(t) Prob. {burst length = (¢ + 1)} 
t=0 


y SN (t)p! 


ef 2 p's(t). (51) 


As proved in the appendix, the matrix in (16) has all its eigenvalues 
strictly within the unit circle. Therefore the series in (51) converges 
forp £1. 
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Multiplying (21) by p‘*? and summing over ¢ from 0 to © we get, 
on re-arranging terms, 


(I = pC — p*D) X p's(t = s(0) + p{s(1) — Cs(0)} 
[I — p(1 — 6,)BJs(0) + p#:Br(0). (52) 


In the appendix we show that (I — pC — p?D) is nonsingular for all 
p <1. Therefore 


GO = ef(I — pC — p°?D)“[{I — p(1 — 6:)B}s(0) + p0:Br(0)]. (53) 


As before, specializing to the interesting case of an initially empty 
buffer, i.e., r(0) = reo, s(0) = (1 — 7)eo, with 7 in [0, 1], we get 


GO = [1 — 1). — p) + p&ijen(I — pC — p*D)~eo. (54) 


We can obtain a recursion for G“ by a procedure almost identical 
to that used in obtaining the recursion for 7“. Note that if z\™ is a 
vector such that 





(I — pC — p*D)z™ = eo (55) 
then the components of the vector z‘%?/z™ are, in reverse order, the 
first NV + 1 numbers in the sequence 8;, 7 = 0, 1, 2, ---, generated by 
the relations 
a= 1 (56a) 
jae ey Ya (56b) 

p(l = 62) ) ae) 
i cg 1 ze pe Or SO). eae 
yo p(1 — 02) = 62) Bia + See ee i= Si-k-l, 
a>k. (56c) 


The first component equation in (55) then gives 


iL k ‘ 
Zz = Pa WiZN-i, Node, (57) 
where mo, ---, 7, are the leading (k + 1) entries in the first row of 


(I — pC — p*D). (The remaining components of this row are null.) 
For N > 2k, each term on the right-hand side of (57) satisfies the 
recursion (56c). Therefore 1/z satisfies the same recursion. From 
(54), since G@™) is proportional to 2 we find that 1/G™? also satisfies 


FORMULAS ON QUEUES—I! 439 


the same recursion, 1.e., for N > 2k, 


1 41 1, ed &—%) 1 
Lo) <1 


—7=3, Gaeen (68) 


It can additionally be shown that the above recursion holds for 
2k = N > 1, by direct substitution of the initial values of G™. 


4.2 Mean time for first passage within a burst 


For a buffer of size greater than N, let ¢ denote the time required for 
the buffer content to first exceed N within a burst. Let H“? denote 
the expectation of t conditional to the hypothesis that the level N is 
indeed exceeded within the burst. (Equivalently, H“ is the mean 
time taken by a buffer of size N to first overflow within a burst, given 
that an overflow does occur.) Clearly 


0 


> (¢ + 1)sy(t)-[Prob. that burst length = ¢+ 1]/@4™ 


t=0 


HW) 


Il 


X (+ Vow (Doan 


ef Y(t + 1p's()/G. (59) 
t=0 
A comparison of (51) and (59) shows that 
d 1 

HW = 7 (0G) -aea5- (60) 
Multiplying (53) or (54) by p and differentiating with respect to p 
we can get closed expressions for H‘? for arbitrary initial state and for 
the buffer initially empty. The resulting expressions are rather un- 
wieldy. 


We can also use (60) to get a recursion for H‘”. Thus let 


yy) 4 ae (61) 
p 
Then 


Sepa (62) 
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where U\™) © (d/dp)V™. Here V“™ satisfies the recursion (58), 


and U™) satisfies a recursion obtained by differentiating the recursion 
for V“, Thus 


1 p(1 — 61 — 42) 1— 6 
(NY) = —_—__ Yn-1) PAS 1 72) 7 (Nk) (N—k-1) 
ye iS ae * 1 — @ ¥ ca 
and 
1 (1 — 01 — 92) by 
(N) = sd (N-1) Nee UA TONE BS ne ee 
: ce ha ie 1 — 4 . c= 
1 (1 — 6, — 42) 
See ee (N) See CNA) 
mL = ba) Vr? =? V . (638) 


V. ASYMPTOTIC BEHAVIOR 


In this section we discuss the behavior as N > © of sequences gen- 
erated by the recursion 


_ 1 _ wl — 61 — 42) 
PN u(1 — 62) —_ 82) PN-1 eae ay ay may PN-k 
1—@ 
oe ime. YN-k-1 = Ey, (64) 
with N=k+1, +2, --- and 0<@<1, 0O<@&<1, and 


0 < uw S 1 the parameter ranges. 

Every recursion derived in this paper can be put into the canonical 
form (64) by simple manipulations; furthermore, all but the recursion 
(63), Section 4.2, correspond to the homogeneous form of (64), i.e., 
vy = 0. In formulas for infinitely long burst (Sections 3.1, 3.2, 3.3) 
the parameter » = 1; in formulas for geometrically distributed bursts 
(Sections 4.1, 4.2)0 <<w=p<l. 

Due to the linear, time-independent nature of the recursions in (64), 
the behavior of the solutions is determined by the sequence {éy} and 
the roots, A;, of the characteristic polynomial : 


C(d, w) & w(L — 62)Met! — AF — (1 — 01 — 02) + w(1 — 6). (65) 


For the special case » = 1 the relevant properties of the roots were 
derived in Ref. 2. Here we derive the properties for arbitrary » in the 
range 0 < » <1. These properties are summarized in the following 


Lemma: For the range of parameters specified above, (a) C(A, nu) has ex- 
actly two positive real zeros \; and dz which lie in the ranges [u(1 — 6) J 
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- 
wat 


—— 





Fig. 2—Proof of lemma. 


< (Ay)* Sw and 1/p S de < 1/n(1 — 62) (the equality signs are un- 


“ 


necessary unless » = 1); (b)* the remaining zeros all satisfy |d;|* < p. 


Proof: (a) Regardless of the sign of (1 — 61 — 62) there are two sign 
reversals in the coefficients of C(\, »). By Descartes’ rule, therefore, 
C(x, #) has at most two positive real zeros. On the other hand, suc- 
cessively setting \ = 0, A* = u(1—6:), *® =n, AX\=1/p, A}=1/ 
u(1 — 62) we find that C(A, w) takes on the respective values u(1 — 41), 
w?0,02[ 4(1 —81) J!*, —pOi1(1—pt)!*), —65(u-*—p), and p6192/(1—6:2). 
Also C(\, nw) ~ +% ast —>+o. For 0 < u < 1, therefore, there are 
exactly two zeros in the respective ranges asserted. For » = 1 further 
examination is required to decide whether one or both of these zeros 
become exactly equal to 1. Noticing that C(1, 1) = 0 and (0/0A) 
C(1, 1) = 01 — k62, it follows that when p = 1, either 1 or 2 or both 
become equal to 1 according as 0, — k@. <0, >0, or =0. (b) We 
will prove the stronger result that the remaining zeros lie strictly 
within the contour T (Fig. 2) defincd by the following segments in the 


complex A plane: 


y 


\ = Reins) OSRS pik (66a) 
= pllkeit 4 <0<29n- ; (66b) 
= Re-s@rn/k) O<R KS pe. (66c) 


To prove this let us define 


I> 


Cy wAL(L — 62)A* — w(1 — 01 — 62) ] (67a) 
C. 2 * — wd — 61) (67b) 


*We are tacitly assuming k > 1. For k = 1, C(A, u) becomes a quadratic with 
both roots positive and real in the ranges given in (a). 
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so that 
CA, w) = Ci — C2 


s6,( 2-3), 68) 


We will show that Re[C,/C. — 1] < 0 for all \ on the contour I. 
Then by an obvious modification of Rouche’s theorem,” it follows that 
C(A, w) and C, each have the same number of zeros within I. As C, 
has k — 1 zeros within [, this proves the lemma. 

To show that Re (Ci/C2, — 1) < 0 for all \ on I, let us consider 
separately the circular are defined by (66b) and the radial lines defined 
by (66a) and (66c). 


(¢) On the circular are (66b) straightforward manipulation gives 


|C1|? zt petelk | C’, |2 
= 20ou? (2 = 61 = 02) (1 — Cos ké) < 0. (69) 


For wp < 1, therefore, |Ci/C:| < 1, hence Re (Ci/C, — 1) < 0. 
If » = 1, this argument remains valid except at points where 
cos k@ = 1, for then |Ci/C,| = 1. However, if cos k@ = 1 and 
m= 1, we find that Ci/C. — 1 = e — 1, whose real part 
<O for r/k S$ 0 S 2x — w/k. 

(iz) On the radial lines (66a) and (66c), 


Cy 


= uR (a = 65 HO10» 


~ RF¥+ y(1 — A) 


which is obviously <0 for R* S up. 


) cos 7 — 1, (70) 


All the recursions of this paper except (63) correspond to the homo- 
geneous form of (64), i.e., &; = 0. Solutions of all such recursions are of 
the form 


k 
gv = 2 Bid (71) 


and therefore the asymptotic behavior is governed by 2, and Ai 
when it is equal to 1. (In the special case p = 1 and 6; = k62, the 
dominant root is repeated and the usual modification must be made.) 
Dropping the subscript 7 from \; and 8; we give below an expression for 
the latter in terms of the initial conditions of the recursion, namely 
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(Yo, PO Pk): 





_ Bab. CG) = 1 Bet Wins: Oe 
p= «| ea qoay et : wit} Let]. ey) 


where 
= p(1 — 2)A**1/[TAF + kp? (1 — 01 — 02) — (k + 1)p(1 — 62)].  (72b) 


Thus, for example, the recursion for the probability of overflow 
[eq. (58) ], with 7 = 0, in the canonical form (64) has the initial 
conditions go = 1, gi = 1/[p(1 — 62) ]*, 7 = 1, 2, ---, k. Also, in this 
case the dominant root of the characteristic polynonial 2 is the only 
root outside the unit circle in the complex plane. Therefore, 


1 


where £ is obtained from (72) for the appropriate values of go, +++, ox. 
It can be easily shown that 6 > 0. In (73) (and similarly throughout 
this section) we use the notation 1/4“) ~ B\¥ to mean that |1/@G™ 
—pr’| < e%, for sufficiently large N, and e < 1. 

In a manner similar to the derivation of (73) we can show that the 
probability of a transmission fault (Section 3.2) has the following 
asymptotic behavior 


61 + 02 
set AN 
ayA3 + 6; a kK. 





_ when 6, < k0» (74a) 


-- 


Fr 9a (ANT + 1) 1— 6, -— 8 _L A. 

avegysr 1 Vv 
~~ —_—_—_— 

= al 


a) a Oi z =e 
05 + ———__— er, ee’ a _ 0; Siege when 0, = k@, (74b) 





A; 
6, + 82 
0, — kbs when 0; > k6>. (74c) 


In (74), a: is obtained from the generic formula (72a). We have shown 
that a; > 0 and, of course, 1 < A, < 1/1 — @. Likewise, the mean 
first passage time (see Section 3.3) is, asymptotically, 


FO) ~ aod os 1 + 62) N a2 > 0, when 0; < k6. 


kO, ——) 64 : 
_ (81 + O2)N? _ 
FC=t= ty RN 
~ (61 + 62)N + Qs, when 6; > k6>. 
6, pes k62 


444 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1974 


Finally, 
H™ wag + aN, a; > 0. (76) 


Vi. COMPUTATIONS 


We have written computer programs to recursively compute the 
quantities 7%), F%), @™), H™) as functions of N for specified values 
of 61, 92, k, and p. Figures 3 through 6 are sample illustrations generated 
by these programs for 6; = 0.2 and 62 = 0.1. The asymptotic behavior 


(N) 
LOG,,6 





0 20 40 60 80 100 120 140 
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160 180 200 89220 


Fig. 3—Probability of overflow in a burst vs level (6: = 0.2, 6. = 0.1, k = 5). 
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Fig. igen time for first passage conditional on overflow vs level (@: = 0.2, 
a. = 0.1, k = 5). 
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Fig. 5—Steady-state probability of transmission fault vs buffer size (6, = 0.2, 
02 = 0 1 . 


N 
LOG, FIN) 
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Fig. 6—Mean time for first passage in infinitely long bursts (6: = 0.2, 6, = 0.1). 


of the various quantities is seen to be in accord with that given by 
eqs. (73)—(76) of the previous section. The dependence on the param- 
eters p and k also is intuitively reasonable. 

APPENDIX 


(a) We prove the assertion made in the text [immediately following 
eq. (37) ] that the eigenvalues of the matrix (16) all lie strictly within 
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the unit circle. Let 


aA (1 = 6,)B ae AI O,A 
sa | 6,B (1 — @)A — XI ie se 


where I is the identity matrix of order N + 1. Then we must show that 
detM #0, for |A| = 1. (78) 


From the defining equations (14) and (18), we notice that the last 
column of A is identically zero. Thus 


det M = — det M’, (79) 


where M’ is obtained from M by deleting its last row and column. Let 
mi;,t, 7 = 0, +++, 2N + 1, denote the elements of M’. Then a theorem 
of Hadamard" states that det M’ ¥ 0 provided M’ is irreducible and 
2N+1 
|mj3| 2P;= DY |mal, (80) 
1=0 1Aj 
for all 7, with strict inequality for at least one j. The irreducibility 
condition as stated in Ref. 11 is satisfied. To show (80) we note that 


|m,,;=|A| and P, = 1, forj=1,---,2N +1, (81) 
and 
|moo| =|1 — 61 — Al, Po = 4. (82) 


Thus except at \ = 1, we find that (80) is true with strict inequality 
for 7 = 0. This proves the assertion (78) except for the point A = 1. 
However, for \ = 1, det M = 6,(1 — 62)% which, by assumption, ~0. 

(b) Following eq. (52) we made the assertion that I — pC — p?D is 
a nonsingular matrix for p S$ 1. The proof is as follows. Let 


A My Mie 
eae Fe ral 83) 


with My, = (1 — 6:)B — AI, Ma; = 4:B, etc. As Mz: commutes with 
Mu, an identity of Schur” states that 


det M = det [MiiMo. = MaM»y |. (84) 


However, straightforward manipulation of the right side of (84) shows 
that 
det M = det (27I — AC — D). (85) 


Then the assertion follows from (78). 
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Although coherent phase-shift keying (CPSK) is an efficient means of 
transmitting digital signals over carrier systems, it has not enjoyed wide- 
spread use at microwave and millimeter wavelengths because of the 
difficulty of recovering an accurate reference carrier for coherent detection. 

In this paper, a system is described which requires only a narrow-band 
phase-locked-oscillator filter for reference carrier recovery. This 1s ac- 
complished by block-coding and decoding the pulse sequence at the ter- 
minals; the recovery of a baseband timing wave is also facilitated by the 
coding process. It is also shown that: (2) for an arbitrary random input 
sequence, accurate carrier recovery cannot be achieved with just a narrow- 
band filter, (it) for the system described, any input pulse sequence 1s 
acceptable, and (iti) there is a maximum error in the phase of the re- 
covered reference carrier which can be controlled by choosing the number 
of pulses in the coding block and the bandwidth of the recovery filter. 


I. INTRODUCTION 


Coherent phase-shift keying (CPSK) is one of the most efficient 
means of modulation for the transmission of digital information over 
carrier systems. In particular, CPSK is at least as efficient as frequency- 
shift keying or differentially coherent phase-shift keying.!-> Equally 
important from the point of view of hardware realization, CPSK is 
suited to operation with amplifiers which operate most efficiently in a 
nonlinear regime; this class of amplifiers includes those using traveling- 
wave tubes, varactor up-converters, and tunnel or IMPATT diodes 
used as power amplifiers or as injection-locked oscillator amplifiers. 

Traveling-wave-tube amplifiers have been proposed for use in 
satellite repeaters, and there is considerable current work directed 
toward the application of millimeter-wave integrated circuit injection- 
locked oscillator amplifiers in digital radio and waveguide transmis- 
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sion systems.‘“!* The CPSK method described here is suitable for 
those applications. 

For the type of operation envisaged for these systems, the statistics 
of the digital sources are usually unknown. To achieve maximum 
operational flexibility, it was assumed at the outset that the system 
must operate with any input pulse sequence. With this arrangement, 
the statistics of the signal source need not be restricted. 

In the system to be described, the recovery of the reference carrier 
phase—a major problem in CPSK transmission—is accomplished with 
the aid of a block-coding and decoding of the pulse sequence at the 
terminals. This coding allows recovery of the reference carrier with a 
narrow-band phase-locked-oscillator filter. 

Another important problem in digital systems with unrestricted pulse 
sequences is the recovery of the timing wave for use in the regeneration 
process. The same coding process which affords reference carrier 
recovery for all sequences also assures timing wave recovery for all 
sequences. 

In this paper, the block-coding, the reference carrier recovery, and 
the timing wave recovery are described for binary and multilevel 
CPSK systems. 


Il. COHERENT DIGITAL PHASE MODULATION 
2.1 The baseband and modulated carrier signal formats 

A diagram of the block-coded CPSK carrier system is shown in Fig. 1. 
Ignoring the block coder for the moment, the input to the radio system 
is a baseband sequence of discrete amplitudes as illustrated by a binary 


sequence of ones and zeros in Fie, 29a. The ones are eodad as nositive 


MALY Y YVR VEEUN WILE HULU SLE B Spe ae BAA Ys WA VY MUM BON PU NauA 


pulses and the zeros as negative pulses as shown in Fig. 2b; this se- 
quence is the input to the phase modulator. Although raised-cosine 
pulses are used for illustration, other pulse shapes can also be used. 

An m-level baseband sequence of raised-cosine pulses shown in 
Figure 2b is written 


v(t) = Vo d anp(t — nT), (1) 
where T' is the pulse interval, V, the peak pulse amplitude, a, = + 1, 
and 

1 2nt T 

on ee: 

5(1 + cos =), tl 35 

p(t) = 
T 
0, |t| > x 
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Fig. 1—Block-coded CPSK carrier terminals. 


TRANSMITTER 





BLOCK 
DECODER 


















RECEIVER 


This baseband signal is used to phase modulate a sinusoidal carrier. The 
output of the phase modulator is 


M(t) = A. cos [wet + D> anp(t — nT)], (2) 


where a, = kr/m,k = +1, +3,---, 4(m — 1). The pulse sequence 
of Fig. 2b represents both the baseband signal voltage of (1) and the 
phase modulation in (2). The peak baseband voltage V, produces a 
peak phase deviation of 7/2 radians for the binary case illustrated. 

A vector representation of the modulated signal is shown in Fig. 3a. 
The carrier amplitude is A. and the unmodulated phase of the carrier 


1 1 0 0 1 1 1 1 0 1 0 0 


(a) BASEBAND PULSE SEQUENCE 
AT INPUT TO RADIO SYSTEM 
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Fig. 2—Binary polar signal format. 
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Fig. 3—Phase plane representation of a binary signal. 


is zero. When pulses modulate the phase of the carrier the amplitude 
remains constant and the phase follows the modulating signal voltage. 
Trajectories for the positive and negative raised-cosine pulses of Fig. 
3b are shown in Fig. 3a for the binary case. 

It is worth noting that double-sideband suppressed-carrier modu- 
lators and switched delay-line modulators are sometimes regarded as 
phase modulators. A justification for this interpretation is that the 
pulses are sampled at the receiver only when the phase is at the peak 
value. However, they differ from phase modulators in the amount of 
amplitude modulation that is generated. The trajectory of the double- 
sideband suppressed-carrier modulation is the vertical axis in Fig. 3a 
between the points A and B; the trajectory of the switched delay-line 
modulation may be intermediate between the vertical trajectory and 
the circular trajectory of a phase modulator. Since future systems are 
expected to have power amplifiers operating in the region of saturation, 
the distortion caused by large variations in amplitude can be avoided 
by restricting consideration to phase modulators. Phase modulators 
suitable for this purpose are described elsewhere. 


2.2 A description of coherent phase detection 


Let the input to the phase detector of the receiver in Fig. 1 be the 
phase-modulated signal M(t). The phase detector requires a local 
reference signal with the proper phase. This reference signal is written 


R(t) = — 2Arsin [wt + e(t)], (3) 
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where e(¢) is any error in the reference phase. The output of the phase 
detector is the low-frequency part, Vr(t), of the product of the input 
signal and the reference signal. 


Vet) = AcArsinL> anp(t — nT) — e(é)]. (4) 


If the phase error, e(t), is zero, and if the output of the phase detector 
is sampled at times ¢ = nT, the output will be +AcApr accordingly 
as dn = + 1 and the transmitted pulse sequence is recovered. 

If the reference phase error is not zero, the signal output amplitude 
will be reduced by the factor cos «. For example, if « = 7/4, the base- 
band pulse amplitude will be reduced 3 dB. An important function of 
the system to be described is to recover the reference phase in such a 
manner that e(¢) is small. 


lil. REFERENCE CARRIER RECOVERY WITH A PHASE-LOCKED OSCILLATOR 


The reference carrier recovery filter is assumed to be a phase-locked 
oscillator with a locking bandwidth much smaller than the bandwidth 
of the modulating pulse sequence. The analysis presented here applies 
to an injection-locked oscillator or a first-order phase-locked loop; 
the noiseless case will be considered. * 

Let the input signal be 


M(t) = A. cos [we + 6(0) ]. (5) 

The differential equation describing the locking behavior of a 

negative resistance sine-wave oscillator has been derived in several 
forms.!7—!8 With the present notation the equation is 


W109 = (w, ~ w) — Asin [e(t) ~ 001, (6) 


where w. is the unlocked oscillator frequency, |w. — w.| <w., 2A is 
the locking bandwidth, and e(t) is the reference phase error. 

Since the oscillator is being used to recover the reference phase, the 
phase error, e(¢), should be as small as possible. For this reason it is 
necessary that the locking bandwidth of the locked oscillator be much 
smaller than the bandwidth of the signal, @(¢). This assumption, in its 
most useful form, means that AT <« 1. Following Adler, expression (6) 
is rearranged as follows: 


de(t) 
dt 





= AK — Asin[e(t) — 0(t)], (7) 





* Eisenberg has presented a related analysis which includes additive thermal noise.!® 
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where K = (w, — w.)/A. The term K represents any initial difference 
between the free-running frequency of the oscillator and the input 
frequency ; in the region of interest |K| < 1. 

We are interested in deriving an unambiguous reference carrier for 
multilevel digital modulation in which each pulse is time-limited to a 
single interval of duration T’. 


6(t) = LX anp(t — nT’). (8) 


During a single pulse the variation in 6(¢) is much larger than the 
variation in e(t) because AT < 1. Therefore, the phase error at the end 
of the nth pulse can be found by integrating (7) over the nth pulse with 
e(t) held constant at the value of the phase error at the beginning of 
the nth pulse. Writing «, = e(nT), we have, from (7) and (8), 


(n-+1)T7 f 
Enzi — €n = a [ {K — sin [en — @(t) ]}dt 


T 


7 cos 6(t)dt 


-ar|x- 


GOS En (n+1)T 


Each pulse is nonzero in a single interval of duration T so we have 
(n-+1)T (n-H1)T . 
J cos 6(t)dt =f cos anp(t — nT)dt = cos|an|p(x)dz, 
nT nT 0 
and 


(ntl) T (nt1)T 
J sin 6(t)di = i, sina,p(t — nT)dt 
nT nT 


an 


= asl fe sin |an|p(x)da. 





Simplifying the notation, we write 


T 
On = An/|anl, Can f cos|an|p(x)dx, 
T Jo 


and 
| ee eee 
S,.= 7H sin|an|p(x)da. (10) 
T Jo 
Expression (9) becomes 
€ntt — €n = ATLK — C, sin en + 02Sn COS En |. (11) 


As shown in (10), C, and S, are functions of the shape of the pulse. 
For the digital signal described by (8) the peak deviation is less than 7 
radians for any number of levels and, for the class of pulse shapes of 
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interest, S, is positive. This is not true of C,—it can be positive, nega- 
tive, or zero. Hisenberg’® has derived C and S for several pulse shapes 
of interest. The sign of C, has an important effect upon the phase of 
the recovered reference carrier; in order that the recovered reference 
carrier have an unambiguous phase near zero degrees, it will be shown 
that a pulse shape must be used for which C, > 0. 

For the binary case, (11) can be written 


ctl — fn _ K — VO? + Sin («. — dn tan ). (12) 


AT 
Let the probability that b, = +1 be p and the probability that 
b, = — lbe (1 — p=). The average phase, ¢,, will be such that the error 


due to = positive pulses is equal in amplitude and opposite in sign from 
the error due to (1 — p) negative pulses. From (12) we get 


p | K - WOFF Ssin (c, — tan )] 
a .2(f= p)| K — VC? + S? sin (« + tan“ =) | ) 


and solving for the average phase, we get 


ee ee eee: ene 
VC? + 2p — 128 


Under the best circuit adjustment, K = 0 and the average phase error 
is given by the second term in (13). When C is positive, the average 
phase is in the first or fourth quadrant and when p = 3 the average 
phase is zero. On the other hand, when C is negative, the average phase 
is in the second or third quadrant and when p = 3 the average phase 
is 7. 

A switch of the reference carrier phase from near zero to near 7 can 
happen in a multilevel system. Consider a 4-level system with rec- 
tangular pulses and peak phase deviations of +2/4 and +37/4. 
Suppose that for a time the pulses alternate between +7/4. From (10), 
C = cos 7/4 = 1/v2 > 0 and the average phase is zero. If the pulse 
sequence then changes to alternate between +37/4, C = cos 37/4 
= — 1/v2 < 0 and the average phase becomes z. This very undesirable 
situation can be avoided by using pulse shapes for which C > 0. In 
the rest of this paper we assume that, in all cases, a pulse shape is 
chosen for which C, > 0. 

It is highly desirable that the average phase of the reference carrier 
be near zero. This means that in addition to requiring that C, > 0, it 


a tant (O51) .. (13) 
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is also necessary that K 0 and p34 as may be seen from (13). 
The parameter K can be kept near zero by setting the rest frequency of 
the phase-locked oscillator equal to the signal carrier frequency. The 
system is required to operate with any input sequence so it is unlikely 
that p will always be near one-half. The input sequence can be coded— 
by a block coder to be described in Section V—into a transmitted 
sequence with p = 3, thus insuring tan (2p — 1)S/C 30. Even 
with coding, however, the reference phase will fluctuate about zero 
and it is necesssary to insure that these fluctuations do not cause 
substantial degradation in performance relative to the performance 
which would be obtained with a perfect reference carrier. It has often 
been thought that the problem in the recovery of reference phase is 
that the occurrence of long sequences of identical pulses drives the 
recovered phase beyond reasonable limits and that if the sequences of 
pulses were sufficiently random this problem would go away. Random 
sequences are therefore of great interest. In the next section the vari- 
ance of the reference phase error is derived and the results illustrated 
by an example. 


IV. REFERENCE CARRIER PHASE ERROR FOR RANDOM SEQUENCES 


The differential equation which describes the phase-locked oscillator 
is nonlinear and therefore difficult to solve. We begin by noting that 
(7) can be solved exactly on a pulse-by-pulse basis if the pulses are 
rectangular. For pulses with other shapes an equivalent rectangular 
pulse can be derived. Then, linearizing the equation, and recognizing 
that the phase error is approximately normally distributed, the vari- 
ance can be estimated for the equivalent rectangular pulses. The binary 
case is considered. 

Let the pulses be rectangular with peak deviation +6,. Then, re- 
arranging (7) and setting 0(¢) = 6,, we have 


(n41)T de(t) = i = 
2 —K+sin[e(t) — 6,] Jar a 


The solution to the integral on the left can be found in many tables of 
integrals. After some algebra the result can be written 


Rtan *5** + (K — tan *5** ) tanh (5 2) 


ee 
B+ (1 — Ktan 5" ) tanh ( ) 








x (14) 
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where en = e[(n +1)T] and R = v1 — K®. Equation (14) is exact 
for rectangular pulses and, if an input sequence of rectangular pulses 
is specified, the exact phase error can be computed. We will estimate 
the variance for a linearized version of (14). When K = 0, (14) can 
be written 





€n41 = 8, + 2 tan (ee tan = - a ) : (15) 


Approximating the tangent by its argument, 


Applying (16) repeatedly we get 


; k-1 
Ene FE ene KAT + (1 — € AT) SS Ong me AMAT, 
m=0 
When k is large, the phase error is independent of n and becomes 


k—1 
6, (1 — eT) & One7™47, (17) 
m=0 
where the pulses have been rearranged to simplify the notation. 
From (12) it may be seen that the error due to a shaped pulse when 
én & 0 is given by 
eR ATV + S sin (tan =) . (18) 
where a pulse of positive polarity is assumed. Comparison of (18) 
with the error in (17) due to the pulse 6, suggests that the equivalent 
rectangular pulse is obtained by letting 


Om = OavC? + §? sin (tan) (19) 
in (17) where bm = + 1 with probability p, b,, = — 1 with probability 


(1 — p). 
The variance of (17) can be found by straightforward means; 
the mean, from (13), and the variance for shaped pulses are: 


pu & tan! (2p — 1) . 
oe (20) 
o2 2 CFS sin (tan ) | AT ol = >): 


The reference error is approximately normally distributed and the 
probability that the reference phase error will exceed a specified value 
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€, 1872 








Pelz a Q(S—")+Q(25"). (21) 
o o 

These results will be illustrated by an example. Let the pulse rate 
be 100 megabits per second and the locking bandwidth 0.5 MHz. For 
raised-cosine pulses with a peak deviation +7/2, 


ue tan! (2p — 1) 
o & 2ZATS*p(1 — p) 


with S = 0.6021947. For p = 4, » = 0 and « % 0.0213 radian rms. 
The fraction of time that the phase error exceeds 0.1 radian is 


P(\e| = 0.1) & 2Q(4.697) = 2.75 X 10°. 


In some applications, changes in pulse pattern density may occur which 
are reflected in fluctuations in p. In this event the probability of a pulse 
being positive will not be constant at p = 4 but will wander slowly 
about this value. Suppose, in the foregoing example, p increases by 
five percent from p = 4 to p = 0.525. Then, wu = 0.05 and 


P(\e| = 0.1) & Q(0.235) + Q(7.04) = 0.41. 
Two important conclusions are illustrated by this example. 


(t) For practical circuit parameters, the probability of exceeding 
reasonable phase errors is uncomfortably large even for well- 
behaved random input sequences. 

(17) Small variations in pulse pattern density can cause large 
increases in the probability of exceeding reasonable phase 
errors. 


It should be understood that the above example is but one of many 
possible examples; however, the filter bandwidth assumed is a practical 
value for systems operating at millimeter wavelengths. It should also 
be noted that the pulse sequences assumed in the example are highly 
idealized and may or may not approximate sequences from real 
sources. 


V. THE BINARY BLOCK CODER 


The coder described in this section is a digital adaptation of a coder 
invented by F. K. Bowers.” 

The operation of the block coder will be described with the aid of 
Fig. 4. The block counter is an up/down counter which counts each 
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Fig. 4—Binary coder. 


successive block of M pulses in the input sequence and indicates on its 
output terminal whether that block contains more positive than nega- 
tive pulses. M is an even integer. The output counter is also an up/ 
down counter which counts all pulses transmitted and indicates whether 
a surplus of positive or negative pulses has been transmitted since the 
start of transmission. The outputs of the two counters are used in the 
decision circuit to invert or not invert the block of M pulses just 
counted, the decision always being made to equalize the number of 
positive and negative pulses transmitted. 

In addition to the framing pulses, a coding pulse is added to each 
block of M pulses and is used in the receiver to re-invert those blocks 
which were inverted at the transmitter. Figure 4 shows the operations 
of a block coder on a sequence of binary input pulses. In this configura- 
tion a framing pulse and a coding pulse have been added to each 
block of M input pulses. While a coding pulse is necessary for each 
block of M input pulses, fewer framing pulses can be used if desired. 

The decoding process at the receiving terminal is illustrated in Fig. 5. 
The position of the coding pulses in the input sequence are known 
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Fig. 5—Binary decoder. 
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relative to the position of the framing pulse. When framing is estab- 
lished, the coding pulses can be detected and the proper block in- 
versions made so that the output sequence will be identical to the input 
sequence at the transmitting terminal. 

A detailed analysis of the coding operation reveals the following 
results for M even. 


(t) The output count, and hence the sum of the output sequence, 
cannot exceed +(1 + 3M/2). 

(77) At the end of each frame of M + 1 output pulses the output 
count cannot exceed +(M + 1); whatever the count at the 
end of a frame, the count at the end of the next frame will 
have moved in the direction of zero by a count of at least one. 

(4a) The maximum number of pulses between a zero in the output 
counter and the next zero is (M + 1)(M + 2). 

(iv) The maximum number of identical pulses is 2 + 5M/2 and the 
output count at the end of such a sequence is +(1 + 3M/2). 


In deriving these properties it is necessary to adopt a convention as 
to the output indicated by the output counter when the count is zero. 
If the count approached zero from the negative side it will indicate 
that a surplus of negative pulses has been sent and the converse is 
true if the zero count is approached from the positive side. Suppose 
the output counter indicates that a surplus of positive pulses has been 
transmitted. The coding pulse is counted as a positive pulse at the 
input counter making an odd number of pulses counted. At the end 
of the frame the input counter indicates that a surplus of positive or 
negative pulses is contained in the block. The block is inverted or not 
so that the output count goes toward zero. Since there is always a 
surplus of at least one pulse in each block, the output counter counts 
toward zero at least one count at the end of every frame; the output 
count can pass through zero in this process. Now suppose that the 
output count is zero and that this count was approached from the 
negative side. The output counter indicates that a surplus of negative 
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pulses has been transmitted. If the next block has all positive pulses, 
the block will not be inverted and the output count will go to (M + 1). 
This is the maximum count which can occur at the end of a frame since 
it has already been shown that at the end of the next frame the count 
must go toward zero by at least one. Property (77) has therefore been 
demonstrated. 

The example can be continued to demonstrate property (2). Let 
the count be (M + 1) at the end of a frame. At the end of the next 
frame the count cannot exceed M so that the maximum number of 
positive pulses that can be added to the count during the frame is 
M/2. Thus, the maximum count is M+ 1+ M/2=1+3M/2 and 
this is property (2). 

The maximum number of pulses between zeros of the output count 
is found by achieving the maximum count of (M + 1) in the first 
block and reducing the count by the minimum of one in successive 
blocks until zero is reached. There are just (M + 2) blocks necessary 
to reach the next zero and (M + 1) pulses per block so the maximum 
number of pulses between zeros is (M + 1)(M + 2). This is property 
(iit). Finally property (iv) is achieved by letting the count at the end 
of a frame be — (M + 1). The next frame has all pulses positive which 
brings the output counter to zero from the negative direction. The 
next (1 + 3M/2) pulses can be positive bringing the total number of 
successive positive pulses to (M+ 1)+ (1+ 3M/2) =2+5M/2 
as stated. 

When the coder is in operation the transmitted sequence contains 
equal numbers of positive and negative pulses. The resulting average 
phase is given by (13) with p = 3. 

€> = sin“ Zs : 
7 C 
The fluctuations about ¢, can be determined from (17). The number 
of pulses between zeros is (M + 1)(M + 2) and if AT is sufficiently 
small that (M + 1)(M + 2)AT <« 1, the exponential terms in (17) are 
approximately unity. Then, since the maximum sum of the output se- 
quence is 1 + 3M/2, an upper bound on the phase error results. 





eee iene ear (1 Pees ) On, (22) 
C 2 
where @,, is the equivalent rectangular pulse as given in (19). 
A graphic example of the effect of coding is given in Fig. 6. A se- 


quence of 200 random pulses from the Rand table of random numbers 
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Fig. 6—The effect of coding a random sequence. 
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is shown at the bottom of the figure.?* The sum of these digits, >>, bn, 
is the upper plot. Note that although there are many transitions be- 
tween positive and negative pulses the sum remains above zero most 
of the time. The slow drift of this sum illustrates the manner in which 
the phase error wanders. 

The same input pulse sequence is shown after coding in a block coder 
with M = 8. The framing and coding pulses are not present. In the 
uncoded sequence the maximum error for AJ = 0.01 is 0.36 radian 
(20.6 degrees) for rectangular pulses with 7/2 radian deviation, 
whereas the maximum phase error in the coded sequence is 0.06 
radian (3.4 degrees). 

The original sequence in Fig. 6 is not a rare case. As shown, >>» bn 
reaches 36 and the probability of this is 


Pavo(|X Dal & 36) 2 (se | 
ry 2Q(2.54) = 0.011. 


Thus, about one out of a hundred sequences of 200 pulses each has a 
sum at least as great as the one shown in Fig. 6. 

The price paid for the recovery of the reference carrier with a small 
phase error is an increase in the transmission rate by the factor 
(M + 1)/M. 


Vl. TIMING WAVE RECOVERY 


It has been shown that, by coding the transmitted pulse sequence, 
the reference carrier can be recovered accurately. As shown in Fig. 1, 
the reference carrier is used to drive the phase detector in which the 
baseband pulse sequence is recovered. In a self-timed system, such as 
the one depicted in Fig. 1, it is necessary to recover a timing wave for 
use in regenerating the pulse sequence. Block-coding also helps in this 
process. 

Bennett has shown that a timing wave can be recovered by suitable 
nonlinear operations even if a spectral line at the timing frequency 
does not exist; the method requires a suitable number of transitions 
between signal polarities.*4 But the block coding discussed in Section V 
insures that the largest number of pulses between signal transitions 
is 2+ 5M/2. Therefore, the block coding insures the recovery of 
both the reference carrier and the timing wave for any sequence of 
pulses whatever. 

For the sequence of pulses shown in Fig. 6 it is instructive to note 
that, although the sum }_, b, fails to cross the axis for a string of 186 
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pulses, there are frequent transitions between signal states and for 
each transition the timing recovery filter will receive a timing pulse.” 
There are 98 transitions in all and the maximum number of identical 
pulses between transitions is six. This is typical of the behavior of 
random sequences and is the reason that the recovery of the reference 
phase is usually more difficult than the recovery of the timing wave. 


Vil. MULTILEVEL BLOCK-CODED CPSK 


The binary coding scheme described in Section V can be extended to 
4, 8, 16, and higher numbers of levels. In each case the coder operates 
to equalize the numbers of pulses with equal amplitudes and conjugate 
phases. For example, the 4-level coder illustrated in Fig. 7 equalizes 
the numbers of pulses with 7/4 radian peak deviation and opposite 
signs, and equalizes the numbers of pulses with 37/4 radian peak 
deviation and opposite signs. The 4-level decoder is shown schemati- 
cally in Fig. 8. 

Because the multilevel coder equalizes the numbers of positive and 
negative pulses for each pair of levels the computation of bounds on 
the phase error reduces to the binary case. A bound can be computed 
for each pair of levels and the largest bound applies; this will usually 
be the bound computed for the pair of levels with the largest deviation. 


BLOCK 
COUNTER 


PHASE 
SELECTOR 


















OUTPUT 
DECISION COUNTER 
+135° 


















FRAMING 
AND 



















+135° CODING 
CHANNEL A a 
INPUT INVERT 4-LEVEL 
a OR 4—LEVEL | OUTPUT 
NOT COMBINER 
CHANNEL B INVERT 
INPUT 





PHASE FRAMING 
SELECTOR AND 
+45° CODING - 






OUTPUT 
COUNTER 
*45° 


DECISION 





BLOCK 
COUNTER 


464 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1974 


Fig. 7—Four-phase block encoder. 
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Fig. 8—Four-phase block decoder. 


Vill. CONCLUSION 


The CPSK system which results from block-coding the input digital 
sequence as described in this paper has the following properties: 


(t) The system places no restrictions on the pulse sequence ac- 
cepted from the source; any sequence whatever can be 
transmitted. 

(ii) Recovery of the reference carrier at repeater points is ac- 
complished with a narrow-band filter. 

(cit) A timing wave can be recovered for any sequence of pulses. 
(tv) Any pulse shaping required can be done at baseband. 

(v) The phase-modulated carrier is suited to operation with 
nonlinear amplifiers; in some applications RF filters are not 
required to shape the spectrum. 


The costs of providing these features are: 


(i) A block coder must be supplied at the transmitting terminal 
and a decoder at the receiving terminal. 

(77) The transmission rate is increased by the factor (MZ + 1)/M 
where M is the number of pulses in the coding block. In 
principle, M@ can be very large; in practice, it will be limited by 
the frequency stabilities of the RF oscillators used in the 
system. 

(cit) The error rate is increased by the factor 2(1 — P.) because 
an error in a coding pulse causes M errors in the signal se- 
quence. This increase in error rate is of little practical im- 
portance. 
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Controlled Current Filaments in PNIPN 
Structures With Application to 
Magnetic Field Detection 


By G. PERSKY and D. J. BARTELINK 
(Manuscript received August 20, 1973) 


Stable biasing of multiterminal PNIPN structures to support con- 
trolled current filaments is proposed. A filament forms when base layer 
spreading resistance is sufficiently high for lateral base voltage drops to 
shut off injection at all but a small interior portion of the structure. For 
elongated parallel stripe emitter-base configurations, application of a 
magnetic field normal to the current filament and stripe axes results in 
lateral displacement of the filament which is detectable through a change 
in the external circuit current flow pattern. This displacement can be 
significantly larger than that of a single-pass Hall deflection, yrelding 
high sensitivity. Analysis of an tdeal model confirms a substantial 1m- 
provement in performance over that of conventional Hall devices, viz., 
a manyfold increase in the ratio of short circuit signal current to drive 
current, similar improvement in signal-to-offset ratio, and controllable 
high output impedance making large signal voltages available. Solutions 
for the tdeal model are presented for carrier transport in the I region both 
without and with lateral diffusive spread. It 1s argued that departures of 
actual device behavior from this model are not apt to be important. Possible 
circurt connections and a sample calculation of parameter values for a 
realizable structure are also given. 


I. INTRODUCTION 


The purpose of this paper is to show how PNPN structures can be 
biased stably to support controlled current filaments and to describe 
a sensitive magnetic field detector utilizing this principle in a PNIPN 
structure. PNPN devices are widely used as 2-terminal bistable 
switches! and as 3- and 4-terminal controlled switches,’ and have also 
been utilized in 4-terminal operation as a linear amplifier.* The multi- 
terminal circuit operation of the PNIPN structure described here forces 
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nearly equal base and emitter currents and thereby suppresses these 
switching and amplifying effects. Stable filament formation properties 
are introduced when significant spreading resistance is incorporated 
in each base layer. For the operating conditions considered, the central 
junction remains in reverse bias and supports counterflowing con- 
fined streams of both electrons and holes. The shape and position 
of this filament are controlled by fully characterized device and cir- 
cuit parameters, in contrast with previously reported filamentary 
instabilities.‘ 

Magnetic field sensing is made possible because a magnetic field 
applied perpendicular to the filament displaces it laterally and thereby 
produces a signal in the external circuit. The displacement can be 
many times larger than the Hall displacement of either carrier species 
for a single transit of the I region. The I region is incorporated in the 
structure for the purpose of increasing filament length and hence its 
interaction with the magnetic field. The analysis will show that the 
sensitivity of the device can markedly exceed that of an ideal Hall 
effect detector of similar dimensions. Improved sensitivity is per- 
mitted because the compensating electron and hole streams prevent 
the buildup of a net Hall voltage. For moderate magnetic fields, 
detection is linear, yielding field polarity as well as magnitude. This 
behavior differs strongly from that of previously reported filamentary 
magnetic sensors in which detection is related to precipitous disrup- 
tion of the filament when the field reaches a sufficient magnitude.® 

Section II explains how stable multiterminal operation of the 
PNIPN structure can be achieved and how base resistance leads to 
the formation of a controlled current filament. An intuitive picture of 
the magnetic response is then developed. Sections IJI and IV present 
an analytical treatment of the filament characteristics and the magnetic 
response, respectively. Two cases are considered, transport in the 
intrinsic region without lateral spread and with diffusive spread. 
Section V assesses various effects that may cause actual device be- 
havior to depart from the ideal operation predicted in Sections III 
and IV, shows possible circuit connections for the device, and pre- 
sents theoretical performance characteristics for a realizable structure. 
Section VI summarizes the main features of the analysis. Preliminary 
experimental results are presented elsewhere.*® 


ll. GENERAL CONSIDERATIONS 


Figure la shows an elementary circuit which causes the emitter 
currents to equal the base currents in an idealized one-dimensional 
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Fig. 1—(a) Four-terminal connection of PNIPN structure. (b) Current loop with 
figure-8 configuration. (c) Terminal 1-2 I7 — Vr characteristics. 


symmetric PNIPN structure with infinite current gain in each emitter- 
base configuration. We assume in addition that there is no significant 
recombination in the I region and that electrons and holes have 
identical properties apart from the charge sign. The current, Ir, 
supplied by the constant current source in Fig. la, follows a figure-8 
path as shown in Fig. 1b. Upon entering emitter P., the current is 
injected as hole current through base N, and region I. It arrives on 
base P» where, as a stream of majority carriers, it can exit only through 
contact 4 to battery V,. Simultaneously, electrons are injected by 
emitter N, to arrive at Nz where, as majority carriers, their only path 
is to close the loop through contact 3. It is the direct external con- 
nection through battery V, that permits stable conduction of the 
current 2/7 in the I region. Interruption of this external current would 
force the central junction to become forward biased, corresponding to 
the ‘‘on’’ state of the switching mode. With battery V. in place, a 
typical terminal characteristic between contacts 1 and 2 is shown in 
Fig. 1c. It is single-valued and consists of the characteristic of a 
battery V, and two diodes, all connected in series. Clearly, any finite 
impedance source connected between these terminals will give de 
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stable operation. The fact that stable 4-terminal operation of PNPN 
devices is possible has recently been demonstrated. 

With the addition of a resistance FR in series with battery V., the 
voltage across the whole structure is reduced by I7R, causing the 
characteristic eventually to bend back into a negative resistance region 
as indicated by the dashed curve in Fig. 1c. Stable operation will then 
require a source impedance greater than Rf. Note that this type of 
voltage turnback is consistent with common-base current gain a 
maintained at or near unity for each emitter-base configuration, 
throughout the negative resistance portion of the characteristic except 
near zero voltage. We have operated a commercial 4-terminal PNPN 
device, as well as an Ebers equivalent pair of transistors, in this 
circuit and have observed a stable negative resistance as depicted 
in Fig. 1c. 

Formation of a stable current filament is brought about by base 
spreading resistance in the otherwise ideal PNIPN structure. The 
filament formation mechanism can be understood qualitatively with 
reference to the schematic illustration given in Fig. 2. We retain the 
assumption that the central diode is everywhere in reverse bias and 
that a = 1 for each emitter-base configuration. This structure is 
explicitly 2-dimensional, having a stripe geometry, and there is as- 
sumed to be no functional dependence on the third coordinate. With 
the end terminals of each base layer shorted together as shown, the 
current filament will locate itself along the center line of the structure. 
We now trace the temporal evolution toward this state, starting from 
an initial distribution of hole current which is assumed to be uniform. 
Upon arrival on P,, the hole current flow is divided between the base 
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Fig. 2—Filament forming structure with multiterminal circuit connection. 
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contacts and, consistent with uniform spreading resistance of the base, 
produces a parabolic voltage profile with its maximum at the center. 
The total injection of electrons from N, must correspond to a current 
Ir, but the base voltage profile will not permit this injection to be 
uniform. Because the base-emitter voltage is a maximum at the center 
and because the law governing electron injection is a highly nonlinear 
function of this voltage, the electron current density will peak sharply 
at the center. With only moderate lateral spreading in the I region, 
which is readily attainable,’ electrons arrive at N» with a distribution 
still peaked at the center. Since the average electron must now cross 
a greater length of resistive base than the average hole did in the uni- 
form distribution, a greater maximum base voltage will be developed 
and the voltage gradient at points away from the contacts will be 
enhanced. This sharper voltage profile will in turn lead to an injected 
hole distribution more sharply peaked than the incident electron dis- 
tribution. The analysis will show that, after a steady state is reached, 
the injected distribution of electrons and holes becomes identical. 
Because of the exponential injection law, this steady-state profile will 
become progressively sharper as Jr is increased. In particular, when 
the voltage from base center to base contact is 1 V, the ratio of 
current density at the center to that at the edge is exp (qV/kT) ~ e®. 
When the filament is highly localized at the center, it is clear that the 
base resistance acts very much like the resistor R in series with battery 
V. in Fig. la, and that negative resistance from terminals 1 to 2 in 
Fig. 2 will similarly result. 

The sharpest filament profile occurs when the I region is made ex- 
tremely thin to eliminate the lateral diffusive and/or space-charge 
spread. Although a thick I region is needed for good magnetic field 
sensitivity, previous work on confined electron beams in Si’ demon- 
strates that highly localized distributions of electrons and holes arriv- 
ing at the base layers can still be expected. Accordingly, in this paper 
it is assumed that space-charge spreading is negligible for reasons of 
low beam current or electron-hole charge compensation, and diffusion 
will be used to characterize the lateral spread. 

When a magnetic field is applied into the plane of Fig. 2, the filament 
will move some distance to the right of center, producing an observable 
current unbalance in the external circuit. Such bodily displacement of 
the filament is brought about by the Lorentz force, which by virtue 
of the counterstreaming motion of the electrons and holes causes a 
Hall displacement to the right for both carrier species. If there were 
no effects tending to return the filament to center, the interjection of a 
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unidirectional Hall displacement into each pass of the regenerative 
particle flow loop would translate the filament indefinitely to the right, 
in the manner depicted in Fig. 3a. 

However, when the filament is shifted off-center, a ‘‘restoring force”’ 
is produced. This force is proportional to the displacement of the 
filament from the center, while the Lorentz force remains constant. 
Therefore, an equilibrium position is attained for which the return 
injection maximum is displaced back toward center by an amount 
equaling the single-pass Hall displacement, as indicated in Fig. 3b. 
Further insight into the nature of this equilibrium state can be gained 
from a study of Fig. 3c, which illustrates the relationship between the 
arriving hole current distribution, J,(«), and voltage profile V(x) in 
base P». Since the distribution J (x) is displaced to the right of center, 
it sends more current to the right-hand contact than to the left-hand 
contact because the resistance is less looking to the right. The point 
in the J,(x) profile which divides the leftward from the rightward 
flowing currents must therefore lie to the left of the centroid of J (x). 
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Fig. 3—(a) Representation of multipass displacement in the absence of a restoring 
force. (b) Representation as in (a) with restoring force. (c) Illustration of relation 
between hole current profile J,(x) and base voltage V(x) for a displaced filament. 
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This division point is, of course, the point of maximum base voltage 
since it is the electric field in the base which causes current conduction 
toward the contacts. In the vicinity of each contact, far from the 
filament, the magnitude of the slope of the V,(x) curve must cor- 
respond to the total current at that contact. The ratio of these slopes 
is, for the present discussion, adequately characterized by the assump- 
tion that the straight-line extrapolations intersect at the centroid 
position x,, as depicted in the figure, and thus correspond to a ratio 
(L + x,.)/(L — x.). The leftward displacement of the maximum of 
V(x) from the J,(x) centroid is therefore determined by the require- 
ment that the areas under the J,(x) curve to the right and left of the 
division point be in the ratio (L + x,.)/(L — 2x,). The significance of the 
leftward displacement x, is that the return injection profile of the 
electrons peaks at the voltage maximum and is therefore displaced 
leftward from the centroid of the arriving distribution by this amount. 
Equilibrium occurs when z, is equal to the rightward single-pass Hall 
displacement xy. 

It is apparent that, to within the above approximations, filament 
displacement in the magnetic field must be linear since x, « xq and 
2q = Ly. Furthermore, the sensitivity increases with drive current 
because this increase narrows the filament, requiring a larger off-center 
displacement x, to bring xq into equality with x. For narrow filaments 
it is possible for the displacement to be many times larger than xz, 
resulting in a signal current greatly exceeding that of a Hall device of 
similar dimensions. As a practical matter, the short circuit signal cur- 
rent of devices typified by Fig. 2 will saturate at perhaps ten times 
that of a Hall device, because the sharpness of the profile eventually 
becomes diffusion-limited. However, this does not appear to be a 
fundamental limitation on device sensitivity, as is shown by the ex- 
ample at the end of Section IV. 


Ill. DERIVATION OF CURRENT PROFILE AND TERMINAL CHARACTERISTICS 

IN THE ABSENCE OF A MAGNETIC FIELD 

This section presents the calculation of the filament profile in the 
absence of a magnetic field, as well as the device terminal character- 
istics. It is shown that the shape of the filament can be characterized 
directly in terms of the device parameters in both the absence and 
presence of diffusion. We first consider, in Section 3.1, the highly 
idealized model introduced in the last section, and neglect diffusion 
as well. In Section 3.2 we take into account diffusion, which is the most 
important additional effect present in a real situation. 
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3.1 Fully regenerative solution 


Here we develop the mathematical solution relating the filament 
current profile to the structure parameters and the drive current I 7. 
The various voltages and currents entering the analysis are shown in 
Fig. 4, where it must be remembered that the two contacts on each 
base are shorted together as in Fig. 3. The procedure followed starts 
with a consideration of the lower base layer. Employing the continuity 
equation and the base resistance per unit length, 7, we derive a general 
relation between the hole current per unit length J,;(x) incident on 
base P», and the base voltage V,(x) developed with respect to the base 
contacts. From V,(a) and the terminal voltage V.z, we find the emitter- 
base voltage profile and, through the junction law, the injected return 
electron distribution Jn,(x). Vex is ultimately determined by the 
requirement that the total emitter current is J7. We can write a similar 
relation, for the upper base, between the incident electron profile 
Jni(x) and return hole profile J>,(x). In general, the complete set of 
self-consistent equations is then obtained by introducing the appro- 
priate connection between the incident and return profiles of each 
species. For a symmetrical structure and in the absence of diffusion, 
J n(x) can be directly equated to J,:(x). A single equation immediately 
results. 

The functional dependence of V,(x) on Jp:(v) can be written in 
the form 


ViGe= f “Bla, PV Ioi@a ae, (1) 


where the transfer impedance function Z(z, x’) is the voltage response 
at x to a 6-function of current incident at 2’. It is easy to verify that 
Z(x, x’) is given by 

Z(x, x’) 


‘ 


x 
q. (2) 


r(L — «)(L + 2’)/2L, a 
= r(L + 2)(L — z’)/2L, x 


IA IV 


The other equations required to complete the description of the 
lower emitter-base configuration are the voltage balance equation 


V (x) = Vo(x) + Vex (3) 
and the junction law 


Jnr(x) = J, exp [qV.(x)/kT'], (4) 


where the constant J, has dimensions of current per unit length. 
Equation (4) assumes large injection, i.e., net saturation current is 
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Fig. 4—Definition of variables. 


negligible. Although with filamentary conduction this cannot be the 
case everywhere along the junction, the errors involved are unim- 
portant when high-level injection is achieved in the vicinity of the 
device center. 

Combining Eqs (1) to (4) produces the relation between J,;(2x) 
and Ja,(x): 
Ineln (Fne()/Fe] = fb — aL + xd (ede a 


L ‘ ; j 1 Vg 
+ f b+ yb— xno de 55+ Gt, 6) 


where 
_ ke 


ie a qrL (6) 


is a structural, regenerative current constant and is the amount of 
current necessary to produce a voltage drop kT’ /q when flowing from 
base center to either base contact. Differentiating (5) yields 


Leg dS nr (2) 
Ine) dz 


= ap [e+ mera — [G2 Fnlorrae'], © 








the right-hand side of which may be identified as 1/L times the right- 
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ward flowing current I,.(x) in P». Accordingly, we rewrite (7) as 





a Lge —— a J ne(2)I p(x) /D, (8) 
where 
TAG a | a aN oo ie f ee w)J (ede | 
= ; a J oi(x')dx’ — if: J pi(x’) dz’ 
+ e [- 2'Jn(a')ae' | (9) 
Note that 
Fy(z) = So). (10) 


A single equation in one unknown is obtained by invoking the as- 
sumptions of symmetry and lack of diffusion: 


Jni(x) = Jp: (x) 
J nr(t) = J pr(Z) 
Flt) Pale) 
J p(t) = J pr(2) 


by symmetry, (11a) 
by diffusion = 0. (11b) 


Clearly, all currents are equal. In particular, Jn,(x) = J»p:(x), so that 
from (8) and (10) we obtain 


tig (x) 
dr 


dI v(x). 


dx 





= Tels = I,(x) (12) 
In (12) and thereafter we drop the superfluous subscripts; variable 
I,(x) still refers to the rightward flowing current in P,, and also gives 
the leftward flowing current in Ny. 

The nonlinear second-order differential Eq. (12) can be solved as 
follows. Rewriting (12) as 
@I,(x)  dlz(x) 


dx? dx 





age) OS 


and integrating from 0 to x yields 


dx dx 








2 21g] | = Ke) — GO). (13) 
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By symmetry about the center line of the structure, 


I,(0) = 0. (14) 
Upon introducing the maximum value of the current profile, 
te: _ al,(0) 
Jo = J (0) + dx’ (15) 
eq. (13) therefore becomes 
da, , 2 
dg Tn OTe ee 
Putting (16) into the form 
dl, dz (17) 


Od lig 1h. ODT veg? 


and integrating from —z to x results in 


1 Iy(2) 
eee es ere CES 
VOLT ol veg a (gar ) 
I4(—2) 22 
= =f —-_-* se ; 
ou ( VOLT ol reg )| iy 

Again, by symmetry about the center line, 

Iy(—2z) = — Ip(2). (19) 


Using (19) and the property that tanh—!(y) is an odd function of y, 
we obtain, after some rearrangement, 


——__— JoL x 
1G) = S0Ld Le tanh ( gee ). (20) 


I) = (21) 





From Fig. 4, 


Therefore, the current profile peak J, can be determined from the ex- 
ternally imposed drive current I; with the relation 


Ip = 2V2LS ol reg tanh (. [ret ). (22) 
reg 


Using this J, in (20) gives the functional dependence of the base 
current on position in terms of the drive current and known param- 
eters of the structure. In Fig. 5a, 22,(x)/Ir is plotted vs. x/L for 
various values of the dimensionless regeneration parameter I7/4I reg. 
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Fig. 5—(a) Position dependence of normalized base current for various values of 
regeneration parameters. (b) Filament current profile normalized to unity peak 
value for the same regeneration parameter values as in (a). 


For sufficiently large drive currents such that this parameter is much 
greater than unity, (22) reduces to 





Ip FS 2V2LS ol reg, (23) 
and hence 
Ir -_ qQrrL Job (24) 


Alege 4bT ~ NT ren 
Since the right-hand side of (24) is just the argument of the tanh func- 
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tion in (22), the regeneration parameter is properly approximated by 
(24) for contours of which the slope is small in the vicinity of x = L, 
i.e., the filament does not touch the boundaries. The contours are 
adequately described by 
pay 1 Ir x 
Ts(o) P25 Tr tanh (37-5), (25) 
which follows upon substituting (23) into (20). 
The filament profile itself is obtained simply by differentiating (20) 
or, for larger I7, (25). We find, respectively, 


dels 2 
= 2 oases 
J (x) = J, sechi (Jar ;) (26) 
= iz 2 Ip x 
Ja) = SI. sech (Gt): (27) 


Plots of J(x)/Jo vs. x/L for the values of I7/4I reg used in Fig. 5a are 
displayed in Fig. 5b. It is evident that, for large values of the regenera- 
tion parameter, highly localized current flow is obtained. Equation 
(24) shows that this parameter is made large through increase of I7, 
r, or L. However, Iregf is independent of L so that from (27) one sees 
that the absolute width of the filament is unchanged by variations of 
L for fixed r. For a given Ir the only way to sharpen the filament is to 
increase r. With a high degree of control, the current path is self- 
contained within an interior portion of the structure. Confined current 
flows without benefit of physical nonuniformity and is furthermore 
independent of overall dimension L. The parameter values necessary 
to produce a well-localized filament can be realized in a practical 
structure, as is demonstrated by the example in Section V. 

We now proceed to calculate the terminal characteristics. The most 
straightforward approach consists of relating J(L) to Ir with (26) 
and (22) and using the junction law (4) to relate J (LZ) to V.x. Recalling, 
however, that (4) applies only at high-level injection, which may not 
be satisfied at 2 = L, a more trustworthy method must be employed. 
Since (4) is reliable at « = 0, we may utilize it to find V.(0) from J,, 
and relate J, to Ir with (22). Then V.z is determined from (3) and 
(1), where (26) is used in the integral in (1). It is clear that, whatever 
the junction law, J(L) follows V.z, as impressed through the voltage 
balance described above, even if V.z is negative. Hence, with this 
method the errors in calculating V.z are no greater than those in ob- 
taining J, and V,(0) with the large injection assumption. When J, 
greatly exceeds the saturation current, these errors are small. 
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From (8) and (4), 
Ven = In Uo/J.) — V0), (28) 
where, in accordance with (1), 
: 
V,(0) = / ZO, 2) (eda. (29) 


Substitution of (2) and (26) into (29) yields 


0 
V,(0) = Jor [ P (L + 2’) sech? (, 4m. 5 Ja 
= reg 
1 L ; 9 Jol x F 
+5 Jor f (L — x’) sech (Jar 7 Jaa 
L ; SY ees : 
Jer i Geol (. ee - Jat 
kT J oL 
2 a In cosh (. Dees ). (30) 


Substitution into (28) results in 


Ver = a (Jo/Js) — arn cosh (Jor ) ; (31) 
reg 


where the definition (6) of I,., was used. From (31) 


Vice in| J / I. cosh? [Zee |: (32) 
reg 


Together with (22), (32) specifies the terminal characteristics. It is 
usually reliable only for J, > J, because of the large injection assump- 
tion. When the regeneration parameter is large, (23) may be introduced 
into (82), yielding V.z directly in terms of Ir. 


Vet = erin |r  / NOLS, Treg cosh (Zz )]: (33) 
reg 











II 











The terminal voltage Vr developed by current source J7, as in Fig. 
1c, is 


Vr = Vort Vouittin + 2V er 
AkT ie 
Vo + Visttin + In Ip / 2V2LT Tong cosh ( 7) |. (84) 


I 
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Fig. 6—Form of terminal I7 — Vr characteristic based on fully regenerative 
solution. 


For large argument the cosh function can be approximated by an 
exponential, permitting (34) to be rewritten as 


4kT Ir 
Vr = Vo + Vouit-in + ——1 | pees | - ee 35 
T built-in + q n Hal rLir (35) 


The Ir vs Vr characteristic is displayed in Fig. 6. It should be ob- 
served that the asymptotic negative resistance is essentially the same 
as in the structure of Fig. 1. This is evidence of the fact that, with large 
regeneration and a filament strongly confined to the center of the 
structure, regions of the base away from the filament have an effect 
indistinguishable from external series resistors. 


3.2 Diffusion limited solution 


While the regenerative solution of Section 3.2 may well be applicable 
to PNPN structures with a narrow central junction, we must take 
into account the diffusive spread of the carrier streams in the wide I 
region of a PNIPN magnetic field sensor. In the presence of diffusion, 
eqs. (8) and (9) are still valid, but the equality (11b) between the 
incident and return currents no longer holds. For example, the stream 
of holes Jp,(x) injected by emitter P, spreads under the action of diffu- 
sion while crossing the I region, to arrive at P, with a new broader 
profile J,;(z). An initially spike-like or Gaussian profile arrives as a 
Gaussian, and any other localized distribution also tends toward a 
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Gaussian. This relation can be expressed mathematically by 


L 
Tyi(t) = f G(e, 2d pola)’, (36) 
where G(z, x’) is the diffusion Green’s function® 
G(q, 2’) = 2 exp [-ab(e — 2’) (37) 
Vr | 
and ap is the diffusive spreading parameter. Here ap is given by 
ab = va/4D.W, (38) 


where W is the I region width and vg and Di, are the drift velocity and 
transverse diffusion coefficient of the carriers traversing it. It is, of 
course, assumed here that the diffusive spread is insufficient to cause 
the carrier stream to contact the boundaries at x = + L. 

Utilizing the symmetry relations (1la) and substituting (36) into 
(9), we obtain from (8) the equation in one unknown, J,(z), 


dJ,(c) 1 


x L 
— Tyee es amd) be (L + wae’ [ole 2 \ SI (2 )dx"’ 





= i: (L — x')d2’ ie G(2’, a Sele yaa” |, (39) 


where the species subscript has been dropped. In view of the com- 
plexity of (39), we attempt only an approximate solution. It is evident 
that such a solution would be most difficult in the parameter range for 
which the diffusive spread and regenerative filament width are com- 
parable. In the limit of small diffusion, which we shall not consider, 
perturbation theory could be used to find the slight modification pro- 
duced in the completely regenerative solution. At the other extreme, 
large diffusion, the regenerative mechanism is largely interrupted and 
the incident current profile tends toward a diffusion-controlled 
Gaussian. 

In the case of large diffusion, where the incident current profile is 
Gaussian, we may solve (39) approximately by also parameterizing 
J,(x) as a Gaussian, but with a different spreading parameter. This 
procedure can be justified in the following way. If we had a uniform 
incident current profile, the base voltage developed would be a para- 
bolic function of z. Then, with the assumed exponential junction law, 
the injected return current is fortuitously Gaussian. This return profile 
will remain Gaussian whatever the form of J;(z) in the regions external 
to J,(x), as long as J;(x) is reasonably uniform within the region of 
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J,(x). Therefore, in situations where the return profile is much nar- 
rower than the incident profile, J,(x) is always well approximated by 
a Gaussian. In the diffusion-controlled case, this narrow Gaussian 
return profile diffusively spreads into the broad Gaussian incident on 
the opposite base, thereby closing the self-consistent loop. 
We assume that 
J,(z) = J, exp (—a32?) (40) 


with a, the return profile spreading parameter. Then, after inserting 
(40) and (39) into (36), integration yields 





Ide) = ae i exp[ —a}(x — x’)? ] exp (—atx”)da’ (41) 


a . 
= — J,exp (—a%z’), 
ar 
where 


QOD 

ti Sears a 

is the spreading parameter of the incident Gaussian. In performing the 

integration, it has been assumed that apL >1 and a,L > 1, so the 

limits may be taken at infinity. We insert the form (41) for (36) 

into the bracket on the right-hand side of (39) and integrate again. 
The result is 


& L x 
/ (L + 2S .(2’)de’ — / (L — 2')Ji(e\de’ = 2L I JFila"\der' 
L x 0 





= ve J,ert (aiv). (48) 
Substitution of (40) and (48) into (89) yields 
AT p08 = aL J, erf (a;2), (44) 


which clearly cannot be satisfied at all « for any spreading parameter 
values. The necessary approximation consists of replacing the error 
function by its first-order power series expansion term valid for small 
ax. We obtain 





21 go = a = fe QL,Qto/ Vo + ap. (45) 
Using the normalization of (40), 
Ira, 
Jo = : 46) 
Wr 
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(45) becomes 
2 __ (rap/2LI reg)? a 





at + ada? ace 0, (47) 
of which the meaningful root is 
Trap ( apLl re } apLl p. 
2 rep | fi + s ) — Spars |. 48 
eT ee Velr ) Vale = 


For (48) to be accurate requires that the return distribution fall to a 
negligible amplitude at values of x such that the next expansion term 
in erf (a,x) beyond the first makes an insignificant contribution in 
(44). Thus, setting + = 1/a,, for which 


«(2)-2(0-()/e] 


the criterion is easily seen to be 


a; \? 
( =) K 3. (50) 
This is not really very stringent, because it indicates about 3 percent 
accuracy when the incident distribution is only three times wider than 
the return distribution. A simpler expression for a? than (48) may be 
obtained when the inequality (50) is well satisfied. We may estimate 
the magnitude of the dimensionless ratio apLJ;eg./VaIr by applying 
(50) to (48), together with the relation ap ~ a;, which follows from 
(42) and (50) and is used to eliminate a;. Neglecting the departure 
from unity of the bracketed expression in (48), we see that there 
results the condition 





apLl reg 3 
<= ol 
Val p 2 ey 
Therefore, in the diffusion-controlled regime, a? is well approximated by 
Ira 
2 TAD 
a; AS 52 
2V tL reg 7) 


Surprisingly, the Gaussian parameterization of J,(x) yields a solu- 
tion which, in the absence of diffusion [ap — © in (47) ], departs only 
moderately from the fully regenerative solution (27). Figure 7 com- 
pares these solutions for the same value of J 7 and shows that the Gaus- 
sian approximation overestimates the peak amplitude by 28 percent 
and is correspondingly narrower. Although the Gaussian therefore only 
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——— 1.28 Exp (—1.275y?) 


—_ 


I Sech 2(y) 








Fig. 7—Comparison of Gaussian approximation in the diffusionless case with fully 
regenerative solution at one value of Ir. 


approximately represents the true solution, it demonstrates that we 
can carry the large diffusion approximation well outside its intended 
range of validity without a precipitous drop in accuracy. 

The terminal characteristics in the diffusion-controlled case may be 
found with the same procedure employed for the fully regenerative 
solution. As long as the diffusion-controlled filament remains narrow 
compared to 2L, the asymptotic negative resistance is reduced by the 
factor (1 — 1/Vza:L) which is close to unity. 


IV. RESPONSE TO A MAGNETIC FIELD 


Section II gave a qualitative explanation of the magnetic response 
of the PNIPN structure. It was shown that unequal leftward and right- 
ward base currents resulted. Here we calculate this current unbalance 
in the limit of linear response. We define the signal current Js as the 
increase in current flowing out of the right-hand contact of base Po. 
Small signal calculation of Is is simplified because it presupposes that 
the magnetic driving force is negligibly perturbed by the magnetically 
produced changes in current profile. Thus, the terminal response is 
obtained by perturbation theory without a recalculation of the fila- 
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ment shape. Again, we neglect and consider the effect of diffusion in 
Sections 4.1 and 4.2, respectively. 


4.1 Fully regenerative case 


With the magnetic field directed into the page in Fig. 2, both the 
downward flowing holes and upward flowing electrons are deflected to 
the right by the Hall displacement 


tH = wBW, (53) 


which is the same for both carrier species, assuming equal mobilities. 
As a consequence of this deflection, relations (11b) become 


dale + Ly) = J nigh ®) 


J pi(t + 2) = J pr(2), (54) 


which is applicable as long as the current profiles do not contact the 
boundaries. The symmetry of the structure preserves relations (11a), 
which, together with (54), yield from (8) and (9) 


Treg 5 a eee a Ji(a + xn) | i (L + 2’) Si(x")de’ 
a if Tapes v')J(u'yae' |, (55) 


where we have dropped the species subscript. By changing variables, 
(55) can be rewritten 


tae J (x2) = sad (x) ce (L + 2’) Ji (x')d2’ 
7 l. Me w)Ji(e'\de'| 


= spo | [7 b+ xed 


is (L ~ x')Ji(x’)dx’ + QL [- "si(e')de' (56) 


We recognize from (9) that 1/2L times the first two terms in the last 
bracket is J,(x). The last term, furthermore, can be written to first 
order in the magnetic field as 


aL / "Tada! x — QLagJ.(2). (57) 
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Therefore, (56) becomes 


= Troe Sale) = F Jaleo) — FC, (58) 


which, by (10), can be written 


Ey eee Ce  13(a) — 2 34x). (59) 


ree lee x dx 


Integration of (59) from —L to L, together with the vanishing of 
Ji( +2), yields 


L 
ALL) — B(-D)] = an [| SA@)ae. (60) 
—L 
From the definition of Is 
I 
I,(L) = 5 +Is 


I(-L) = — 32+ Is. (61) 
Therefore, to first order in Is, (60) becomes 
ji= uf A@)de/T. (62) 


Since the right-hand side of (62) is by virtue of xy already linear in the 
magnetic field, the unperturbed filament profile may be used for J; (zx). 
We can see from this equation that Is/Ir will increase for fixed ty 
when the filament profile J;(z) is made sharper. Evaluation for the 
fully regenerative profile (27) results in 

Is LH Ir 


Ge ~ 121 a a ee) 





Substitution for J... from (6) and for zy from (53) gives 


Is _ pBW fa ) 

Ile QL \kT 6 

~ BBW ee) 

gee) iF ( 3kT /]’ (64) 


where »BW/2L is the short circuit current ratio of an ideal Hall 
device of similar dimensions and qV,(0)/3kT is a convenient measure 
of the enhancement of the sensitivity with regeneration. V»(0) 
&rLI7/2 is the center-to-edge base voltage in the absence of the 
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magnetic field and can be on the order of volts, leading to enhancement 
factors in the range 10 to 100. 


4.2 Diffusion limited case 


Putting (9) into (8) and using only the symmetry relation (11a) 
we have, upon dropping the species subscripts, 


Inne Se Tela) = Jolt) gpa | [7 G+ vee yee 
_ iz (L — w)Ja(e')ae' |. (65) 


In contrast with the procedure followed in Section 4.1, it is convenient 
here to integrate (65) from —LZ to +L at once, to obtain 


0 = ie J,(x)d / (L + 2S, (2") de’ 
2 i | ROE J “(L = 2')\di(a')de’. (66) 


Again, we have assumed the vanishing of the filament profile at the 
boundaries, i.e., J-(4+L) = 0. Upon introducing I.(x) defined by 


I(g) = | (67) 


integration by parts of (66) yields 
0 = [I.(Z) + I.(—L)]LIr + U.(L) — 1.(-L)] 
L L 
x / ad (a)dx — 2L il I.(x)Ji(x)dx. (68) 
ay i —L 
From the second form of (9) and from (61) 
2 L 
: 7 ead) eae) 
= 2s. (69) 
In analogy with (61), we define Is by 
I.(+L) = ae 5 t Is. (70) 
I, is a construct which can be interpreted as the lateral emitter cur- 


rent if the emitter, like the base, had contacts at +L. I's is the mag- 
netically produced unbalance in J,. Substitution of (69) and (70) into 
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(68) results in 
L 
0 = Inl's + Ils — [ T(x) J s(x)de. (71) 
aL 


This equation is merely a simplified version of the integral of (65). 

To proceed further, it is necessary to introduce explicitly the simul- 
taneous diffusive spreading and lateral magnetic displacement of the 
carrier stream as it crosses the intrinsic region. Combining (36) and 
(54) leads to the general relation between J; and J,, 


J;(x) = te G(x, «')J,(2" — ty) dz’, (72) 


where G(z, x’) is the diffusion Green’s function (37). Expanding (72) 
to first order in xy yields 


3 L 
J ;(x) wef G(a, x’) J ,(x’)dx’ — on f G(a, 2’) é. J ,(x')dx’ 
L ‘ , ta e d , , ‘ 

= ie G(a, x’) J,(2’)dz’ — va [| + G(x, 2')J,(x2’)dz’, (73) 


where the second form has been obtained through an integration by 
parts with the boundary condition J(+L) = 0, and the relation 


dG/dx’ = — dG/dx. Upon substituting (73) into the integral in (71), 
the first term of (73) gives rise to an integral of the form 
L L 
=f I.(a)de G(x, 2')J,(2')de’. (74) 
sf 27 


It is possible to show by successive integration by parts that 
L L 
= 1(L) [ 1(e)@(e, dx ~ 1.(-L) [ Lo(@)@(@, Ld. (75) 
=h al 


The vanishing of J, in the vicinity of the boundaries corresponds to a 
nearly constant value of J.(x) in the boundary regions where G(x, +L) 
has a significant magnitude. By noting the normalization 


L 
/ G(c, +L)de = 1, (76) 
-—L 2 
we obtain 
§ = 37(L) — e(-L)] (77) 
Therefore, substitution of (73) into (71) eliminates the Js term, leaving 
L L 
Ils = — ru | Ie(a)de f ag. x’) J ,(”')dz’. (78) 
—L —L dx 
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Interchanging the order of integration, integrating by parts with 
respect to x, and utilizing (67) give 


Tales tal 1e(D) ie G(L, 2’)J,(a")de’ 
= 1(-1) [- G(—L, 2')J-(2")dz' 


= ie i : J (x)Q (a, x')I (x')de aa’ |; (79) 


Because J,(x) and G(+1, x) do not overlap, the first two integrals in 
(79) vanish, yielding the final result 


Is _ L orl ies 
i= tn ts hs J,(a)@(a, 2"\J,(a’)de da’ /T2. (80) 


In the limit of no diffusion G(az, x’) — 6(a — 2’) and (80) reduces to 
expression (62), but (80) is valid for arbitrary diffusive spreading. 
For J,(x) parameterized as a Gaussian according to (40) and using 
(37) and the normalization (46), (80) becomes 


7, = tH 207 LT reg/IrV1 + 2a5/ai. (81) 
T 

In the diffusion-controlled regime characterized by a? as given in (52), 
the radical in (81) is approximated by unity, and we find 


ee 
Ir 
= tyV0a/4eD.W. (82) 


The result (82) can also be obtained from (80) by letting J,(1%) > I76(x) 
for which 

7 = r7G(0, 0) = tyap/Vr. (83) 
The equality of (82) and (83) demonstrates that, in the diffusion- 
controlled regime in which a;/a, need only satisfy (50), the structure 
nevertheless responds to a magnetic field as if the return current 
profile were a very sharp spike. The absence of Ir on the right-hand 
side of (82) indicates that diffusion saturates the magnetic sensitivity 
and, unlike (63), the signal is now only linearly proportional to the 
drive current J7. To compare the diffusion-controlled detector with a 
Hall effect device, we substitute for zy from (53), define the voltage 
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across the J region at the center by 
Vs = V.— 2V,(0), (84) 


and introduce the transverse noise temperature of the carriers defined 
by the Einstein relation 








kT, = qD./p. (85) 
Thus (82) becomes 
Ts ~ uBW L qVB ): 
Ip 2L € wkT n (86) 


The expression in parentheses is the sensitivity enhancement factor 
for this case, which should be compared with (64), derived in the ab- 
sence of diffusion. Equation (86) shows that the sensitivity of the 
diffusion-controlled detector is improved by increasing the central 
bias voltage until carrier heating predominates. At 8 V, the radical has 
a value of approximately 10 for W sufficiently large that T, ~ T. 

Equation (86) seems to suggest that large sensitivity enhancement 
with respect to Hall devices can be achieved by making L/W very 
large. This improvement is, however, illusory because it merely creates 
an unfavorable geometry for the Hall device. A fair comparison is 
possible when the device configurations are nearly square. Although 
in this case an enhancement factor involving only qV 2/kT’, is indicated, 
this should not be construed as an ultimate limitation imposed by 
diffusion, but rather as a structural limitation. The following example 
will illustrate how, for fixed W and L, the fully regenerative enhance- 
ment factor can be obtained within the constraints imposed by diffu- 
sion. An analysis has been carried out for a structure in which the 
emitters are contacted at +2 and have resistances per unit length 
approaching but less than that of the base layers. It has been found 
that, in the absence of diffusion, emitter resistance broadens the 
filament but does not diminish its off-center displacement or signal 
current when a magnetic field is applied. Since a broader filament is 
less subject to diffusive spreading when diffusion is taken into account, 
the effect of sufficient emitter resistance is to carry the filament forma- 
tion and magnetic response out of the diffusion-controlled regime 
back into the fully regenerative regime. Therefore, the diffusion limit 
given by (86) would appear to be appropriate only to the structure 
analyzed in detail, rather than to be fundamental. 


V. PRACTICAL MAGNETIC DETECTORS 


The previous sections of this paper have established the fundamental 
principles according to which controlled filaments might be produced 
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in PNIPN structures and have analyzed their magnetic sensitivity. 
Certain idealizations were made in order to develop a coherent theory. 
One purpose of this section is to give at least a preliminary account of 
the effect of removing these idealizations, so that we may relate the 
theory to practical devices. Since magnetic response has heretofore 
been characterized solely in terms of the short circuit signal current J, 
it is also necessary to analyze the behavior of the magnetic detector in 
an actual circuit which presents a finite impedance to the detector 
output. Several realizable circuits are considered. Finally, practical 
design parameters of a particular detector are given and performance 
predictions are made. Because filament formation in these devices 
requires that they be biased into the negative resistance range, there 
may be a tendency for ac instability, notwithstanding their apparent 
stability at de. The dependence of oscillatory behavior on parasitics 
suggests that, at the outset, only experimental resolution of the stability 
question is feasible. 


5.1 Removal of idealizations 


The model developed thus far has been based on the explicit assump- 
tions of (1) complete structural and electrical symmetry, (2) high 
level injection, (3) infinite current gain, and (4) lateral carrier stream 
spreading in the I region by diffusion only. It has also been implicit 
in the analysis that it is permissible to neglect the effects of lateral 
electric fields in the I region, filament position pinning resulting from 
structural imperfections, and possible modulation of base width and 
conductivity. While a detailed investigation of all these effects is 
beyond the scope of this paper, we shall explain why they are not apt 
to modify greatly the operation described in the previous sections. 

In view of the regenerative nature of the filament, the assumption of 
infinite current gain might appear questionable. In actual fact, it is 
easily shown that for finite, but reasonably large, values of common 
emitter current gain 8, device performance is only slightly degraded. 
We consider first the fully regenerative case, i.e., no diffusion. In the 
absence of a magnetic field we recall from (11a) and (11b) that J;(z) 
= J,(x). When 6 — ~, the base current, and hence the base voltage 
Vio(x), are produced entirely by J;:(x) as given by (9) and (1), re- 
spectively. For finite 8, there is an additional base current component 
produced similarly by a current profile J,(x)/6(=J;(x)/8) which is 
subtractive, and hence reduces the base voltage drop to V.(zx) 
= (1 — 1/8)V..(x). This voltage reduction is the same as would be 
caused by retaining infinite 6 and reducing r from the original value 
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To to 


iS (1-3) r. (87) 


Assuming now that an increase in the actual base resistance is made to 
compensate for this effect, no modification results in the filament pro- 
file if the current through the battery is maintained unchanged. To do 
so with finite 8 requires an increase in emitter current by a factor 
(6 + 1)/(6 — 1). It is clear that the filament disappears for 8 < 1, 
but that for 8 > 1 there need only be a small degradation. 

In the diffusion-controlled regime there can be additional significant 
effects of finite 8. When the incident profile is much broader than the 
return profile, we have J,(0)/J;(0) = a,/a; > 1. Therefore, in the 
vicinity of the origin, the injection process will give rise to subtractive 
base current components comparable to those produced by J;(x), un- 
less 8 is sufficiently larger than a,/a;. The presence of such subtractive 
components lowers the base voltage at the origin, broadening the re- 
turn profile and self-consistently lowering J,(0) until 6 > J,(0)/J;(0) 
is suitably satisfied. Clearly, in the diffusion-controlled regime, finite 
current gain places a limit on the sharpness of the return profile which 
cannot be improved by increase of base resistance, 1.e., a, < Bap if the 
approximation of a Gaussian return profile is retained. Because the 
magnetic sensitivity is only weakly dependent on the return profile 
width if (50) is satisfied, as shown by the comparison of (82) and (83), 
it should only be slightly affected by finite current gain as long as 
B > vB. 

We now briefly consider several effects that can modify filament 
formation and translation through localized departure from the simple 
theory. Lateral fields in the I region, brought about by the base layer 
voltage, can cause deflection? of the carrier streams not taken into 
account in the filament analysis. As a result of the symmetry of the 
two emitter-base configurations, there is electrical symmetry about 
the plane midway between the bases. Therefore, the electric field 
streamlines in the I region may converge near midplane, but still 
connect, in 1-to-1 fashion, points on the two base layers lying equi- 
distant from filament center. Consequently, although the filament may 
tend to neck in at the center, this effect will not by itself give rise to 
additional lateral spreading. Similarly, when the filament is displaced 
off-center by a magnetic field, these lateral fields will not cause a net 
restoring force toward device center. 

Filamentary instabilities characteristically occur at the particular 
cross-sectional location where breakdown is most easily initiated. We 
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have shown that in the present controlled filament formation mecha- 
nism, nucleation takes place at the center of the structure. It is still 
possible, however, that at other locations pinning points may exist for 
the filament because of structural inhomogeneities such as, for example, 
a locally enhanced injection efficiency. It is convenient to classify 
such inhomogeneities according to their size relative to the filament 
width. Large-scale inhomogeneities, which we shall assume to be 
reasonably weak, should result in only mild distortion of filament shape 
and position. In using the structure as a magnetic field sensor, this 
effect would produce a de “‘offset voltage,’ but not otherwise interfere 
with the magnetic response. On the other hand, intense small-scale 
parameter fluctuations would provide distinct filament pinning points. 
However, in the diffusion-controlled regime this effect should be much 
reduced. Not only does the diffusion introduce an averaging over 
dimensions larger than the inhomogeneity, but the accompanying 
interruption of the feedback loop serves to damp down the multipass 
gain fluctuations. Because of the filament centering force inherent 
in the simple theory, pinning the filament becomes progressively more 
difficult at points away from device center. Ultimately, however, 
the importance of filament pinning will have to be determined 
experimentally. 

In contrast with structurally associated departures from ideal be- 
havior, localized parameter variations may occur self-consistently in- 
duced by the filament itself. Under conditions of high current density, 
transport in the base may be modified by increased base width or 
conductivity. It is well known that for transistors operated at high 
currents the base tends to widen. A similar effect here would lead to 
a decrease in the base resistance per unit length r. When there is a 
perfectly compensated filament of electrons and holes in the collector, 
however, one would expect this effect to disappear but, if there is 
diffusive spread of carrier streams, locally perfect compensation is 
absent and some base widening may still occur. A similar local de- 
crease in r would result directly from the conductivity modulation 
produced by the injected carriers. This effect is readily minimized by 
making the base layer thin, while keeping the same sheet resistance. 
With a thinner base the minority carrier density for a given current is 
lower, while majority carrier concentration is higher. In any event, a 
local reduction in r will broaden the filament, but one would expect: 
the change in shape to be more pronounced than the actual change in 
width. Similar modification of the filament profile can be anticipated 
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from the falloff of injection efficiency at extremely high injection 
levels. 

We now examine the assumptions of structural and electrical 
symmetry. Structural asymmetries, an example of which is an in- 
equality of base resistance, is subject to technological control and can 
probably be made small. Such asymmetry will invalidate (11a), re- 
sulting in inequivalent electron and hole profiles, but if reasonably 
small it is unlikely to affect the average filament properties or magnetic 
response. In contrast, the electrical asymmetry is mostly governed by 
the disparity of the electron and hole mobilities which is not con- 
trollable and may be quite large. An immediate and important con- 
sequence of such a mobility ratio is inequality of the electron and hole 
Hall displacements. It might appear that, because of this inequality, 
a magnetic field would disrupt the filament by pulling apart the elec- 
tron and hole streams. Indeed, it has been proposed that the magnetic 
response of a GaAs double injection diode can be explained by such a 
mechanism.® In the present system, this phenomenon may occur at 
very high magnetic fields but should normally be avoidable, since the 
filament is broader than the single-pass Hall displacement and there 
is no strongly nonlinear pinning point. We have made an analysis based 
on a rigid displacement of the electron and hole current profiles in 
the fully regenerative case which indicates that no strong disruption 
is to be expected. The results show that the coordinate difference 
between centroids of the return distributions is just one-half the dif- 
ference between their Hall displacements and is therefore much less 
than the off-center displacement. A quantitative measure of the un- 
balance can be obtained from the ratio of the unbalance of the signal 
currents in the two base layers to their average: 


Ign — Isp Sl reg tas — fae | 
Te = Ir ( Lay ) (88) 





where Js, and Is, are the signal currents in Ny and Po, tyn and yp 
are the Hall displacements of electrons and holes, and Js and xy are 
the average signal current and Hall displacement. The factor I7/3Ireg 
is recognized from (64) as twice the enhancement factor, and the right- 
hand side of (88) is therefore much less than unity. 

Another effect of the mobility ratio is the destruction of the inherent 
filament space-charge neutrality, with the result that there will be 
increased lateral space-charge spreading. Qualitatively, the effects of 
space-charge spreading are not greatly different from those of diffusion 
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and therefore the simple diffusion theory should account for its main 
features. Since, unlike diffusion, space-charge repulsion scales with 
filament current, it can be minimized by increasing the base resistances 
so that the necessary base voltage drops can be achieved at low current. 
Another approach is to use a circuit that equalizes the carrier densities 
by equating the electron-to-hole emitter current ratio to the mobility 
ratio, thereby restoring a nearly neutral filament. 


5.2 Magnetic-detector circuit connections 


Up to this point, the response to a magnetic field has been character- 
ized only in terms of a signal current Js. Here we consider the inter- 
connection of the detector with a finite load impedance. In the circuit 
of Fig. 2, Is could have been detected only by a perfect ammeter. 
Figure 8 shows a straightforward circuit modification which provides 
terminals for the connection of load resistors, Rz. In the absence of 
magnetic field, the voltage and current of the six device terminals, 
and therefore the filament profile, are completely unaltered by the 
addition of the external resistors R.x, provided battery Voo has the 
value 

Vo0 = Vo + delves: (89) 


The magnetic response is most easily understood by adopting an 
alternative view, in which resistors R.-x are considered part of extended 
base layers having total effective resistance 2Rer = 2Rex + 2rL. If 
the filament remains sufficiently confined to fall well within the actual 
device boundaries, the whole configuration behaves as if it has a base 
of effective length 2L.¢¢ related to Rers by 2rLers = 2Rett, so that 


R 


ex 
r 





Lets = L + (90) 
When the load terminals are open circuited, 1.e., Rz > «, the signal 
current for both the fully regenerative and diffusion-controlled case, 
given by (63) and (86) respectively, are unchanged by the change from 
L to Ler. In (63) the product LI rez, and hence Is, is independent of 
L by (6), while in (86) Js is explicitly independent of L as long as 
battery V oo has been increased in accordance with (89). The open circuit 
voltage Vo is therefore 

Vio = 2Rexl s (91) 


and has the polarity given in Fig. 8(a). When the terminals are short 
circuited (R;, = 0), Js is still that given by (63) or (86) and now flows 
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Fig. 8—(a) Magnetic field detector circuit with provision for load Rx. (b) Load 
line for (a). 


completely through the short as Io. Within the small signal approxima- 
tion, the device is linear and we obtain the load line given in Fig. 8(b). 
It is an interesting feature that the output impedance, 2R.x, is given 
solely by the magnitude of external resistors. The apparent ability to 
obtain an indefinite increase in open circuit voltage, by increase of R-x, 
is just a reflection of the fact that battery Voo is correspondingly in- 
creased in accordance with (89). It is worth noting that, although Js 
has the same value in both the open and short circuited conditions, 
the off-center displacement of the filament, x,, is unequal in the ratio 
Ler:/L, reflecting the stronger centering force in the case of the short 
circuit. 

A problem encountered with all magnetic detectors is that structural 
nonuniformities result in “offset voltages.” If the present structure had 
only a single base layer, the filament would locate itself at the elec- 
trical center and there would be no offset voltage. It is expected that 
in the actual structure the electrical centers of the two base layers 
will not exactly coincide so that the filament will seek an intermediate 
position. The result will be an offset voltage for each base layer. It 
should be clear that the position of this new electrical center is de- 
termined only by structural imperfections and will therefore not depend 
on the enhancement factor. Consequently, the ratio of signal-to-offset 
voltage for this device should exceed that for the equivalent Hall effect 


PNIPN CONTROLLED CURRENT FILAMENTS 497 





Fig. 9—Signal-summing offset-nulling magnetic detector circuit. 


device by this enhancement factor. Furthermore, it is possible to 
envision a circuit connection, as shown in Fig. 9, in which the signal 
currents of the two base layers are additive, while their offset voltages 
are cancelled at least to first order. This circuit has a cross connection 
of the two base layers by means of two batteries V.. It also has the 
interesting feature of displaying terminal characteristics of a nearly 
ideal magnetically controlled current source 2/ s. 


5.3 Sample device parameters 


Figure 10 shows a realizable configuration of the magnetic detector. 
It is a planar structure formed on a nearly intrinsic substrate. The 
largest areas are base layers N; and P». Application of reverse bias V, 
between N, and P, depletes the substrate in the intervening region. 
Heavily doped emitters N. and P, are shaped to be completely on 
top of the base layers. This structure, with the dimensions shown, can 
readily be fabricated with current technology and therefore constitutes 
a reasonable choice for initial experiments. It is also assumed that a 
base sheet resistance of 10 kQ/O is attainable. With these con- 
straints the structure is far from optimum, but the performance 
characteristics shown below nevertheless compare favorably with 
other magnetometers. 

With a base sheet resistance of 10 4Q/O and a base width of 12.5 wm, 
we find a resistance per unit length r = 800 Q/um. Equation (6), with 
L = 100 um, then yields J... = 0.312 vA. For a drive current I7 = 10 
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pA, the filament profile in the fully regenerative case is found from 
(27) to be 
J (x) = 0.4 sech? (s _ ) pA/um. (92) 
Referring to Fig. 7, the half amplitude points fall at z, = + 0.85L/8 
~ +11 um, so that the filament is indeed much narrower than the 
length of the base. Using (80) the corresponding voltage from base 
center to edge, V,(0), is 0.866 V. This result may be compared with 
the value 0.4 V obtained by assuming a perfectly sharp profile for 
which [7/2 flows through a resistance rl, and indicates that the 
finite filament width gives rise to a less than 10 percent voltage 
reduction. 
Two considerations enter the choice of the battery voltage V,. First, 
it must be sufficient to fully deplete the substrate material between P, 
and N,. Assuming a bulk resistivity of 5 kQ-cm or better after the 
necessary processing steps, 5 V would be enough to deplete a plane 
parallel structure 50 um across. Allowing for some extra width necessi- 
tated by the plane configuration and some margin for being well 
swept out, a voltage V, = 11 V, corresponding to a drop of ~ 10 V at 
xz = 0, should be just adequate. The second consideration is the diffu- 
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Fig. 10—TIllustrative example of realizable magnetic detector. 
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sive spread. At x = 0, the average field in the I region will therefore 
be in the neighborhood of 2 kV/cm. This field is insufficient to greatly 
heat the carriers, so that it is justified in (88) to use D: expressed by 
(85), with kT,./¢~kT/q = 0.025 eV. The resulting value of ap is 
0.2 wm, for which the half-amplitude half-width of the Green’s 
function (37) is 4.16 um. This value is small compared with the value 
XL» = +11 um for the filament and works out to an additional spread 
of only about 12 percent. It is therefore proper to use the fully regenera- 
tive solution to calculate the magnetic response. 

The regenerative enhancement factor defined in (64) for the above 
parameters works out to a value of 5. This value only specifies the 
enhancement of the short circuit signal current Js over that of an 
equivalent Hall device. The full available output voltage when the 
device is used in the circuit of Fig. 8, however, still depends, by (91), 
on the choice of R.x. Choosing R-x arbitrarily to be 1 MQ, adjusting 
Voo according to (89), and using (63) and (91) leads to 


Vio = 22BI pr volts, 


which corresponds to a figure of merit of 22 V/GA. This figure of 
merit is of the same order of magnitude as that reported for other 
sensitive magnetometers.!! It is expected that considerable improve- 
ment can result from proper design. 


Vi. SUMMARY 


We have shown that spreading resistance in the base layers of a 
stripe geometry PNIPN structure, with cross section and circuit as 
shown in Fig. 2, leads to a localized current density profile, ie., a 
filament. If lateral spread of the carrier streams in the I region can be 
neglected, the current density profile is adequately represented by 
eq. (27). A plot of this function appears in Fig. 5b, which shows that 
as the drive current Ir is increased, a sharpening of the filament occurs. 
The relevant parameter is the ratio of Ir to Ireg, where I reg, defined in 
(6), is the amount of base current that would have to flow from device 
center to a base contact to produce a voltage drop kT'/q. The numerical 
example given in Section V shows that for a realizable structure a 
- typical value of Treg is ~ 0.3 wA, so that for Ir ~ 10 pA a highly con- 
fined filament is obtained. When carrier transport in the I region is 
characterized by significant lateral diffusion, the ultimate sharpness 
of the filament becomes limited. For sufficiently large Ir it becomes a 
good approximation to represent both the return and incident current 
density profiles by Gaussians: (40) and (41), respectively. Although, 
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as shown by (52), the return profile Gaussian continues to narrow with 
increasing Ir, the incident profile saturates to a width determined 
solely by diffusion, i.e., for a,—> © we have a;—ap, where ap is 
given by (88). Using at filament center vg = nH = uVz/W and the 
definition (85) of the transverse noise temperature, we find a} = 
qV 2/4kT,,W?. Therefore, the width of the diffusion controlled filament 
is independent of parameters characterizing the lateral extent of the 
structure. 

The small signal linear analysis of the magnetic response of the 
PNIPN structure suggests that it may be regarded as a magnetically 
controlled current source. The principal result of the paper, eq. (62), 
relates the magnitude of the magnetic signal current Js to the drive 
current I, the single-pass Hall deflection x,,, and the incident current 
density profile in the absence of diffusion. Noting that I7/2L repre- 
sents the average current density (J;(x)) and that for any nonuniform 
function (J2(x)) > (J;(x))?, we see from (62) that Is/Ir will always 
be larger than x,/2L, with the inequality increasing for progressively 
sharper filaments. Since X 7/2L is just the ratio of short circuit signal 
current to drive current for an ideal Hall detector of dimensions W 
and 2L, a clear advantage is indicated. A convenient measure of the 
enhancement is given by the factor qV.(0)/3kT in (64), where V,(0) 
is the center-to-edge base voltage in the absence of the magnetic field. 
This factor can be in the range 10 to 100. When lateral diffusion in the 
I region is important, (80) must be used in place of (62). Equation 
(80) involves the return profile J,(x) because J;(x) is explicitly re- 
lated to J,(x) by the diffusion Green’s function. The sensitivity en- 
hancement still depends on the sharpness of the current density 
profile, but now, as shown by (83), an infinitely sharp return profile 
J,(z) leads to only a finite enhancement factor, given in (86) in terms 
of the fundamental parameters. Depending on the device geometry, the 
enhancement factor can again be of order 10 or more. The parameters 
which enter it are those pertinent to the diffusion-controlled filament 
and do not include r or Jr. Although this limiting behavior follows 
directly from the assumption of an infinitely sharp return profile, 
the derivation of (82) and subsequent discussion makes clear that it 
is also descriptive of the sensitivity when the return current profile 
is only moderately sharper than the diffusion-broadened incident pro- 
file. Because, within the limits set forth in Section V, the PNIPN 
magnetic detector behaves as a magnetically controlled current 
source, its useful output voltage is determined solely by the circuit 
in which it is imbedded. For the circuit of Fig. 8, the device considered 
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in the numerical calculation should have a sensitivity of 22 V/GA 
when driven at J7 = 10pA. 

An important feature of the PNIPN structure is the possible reduc- 
tion of the offset level which is so troublesome in magnetic sensors. 
There are various ways in which this reduction can be effected. Most 
directly, the offset current, being of geometric origin, is not subject 
to the enhancement factor experienced by the signal current, and 
the signal-to-offset ratio is correspondingly improved. Furthermore, 
the addition of matched external resistors, as in Fig. 8, permits ex- 
ternal control of the offset because such resistors act as extensions of 
the base layers, increasing the effective length of the device and thereby 
making a percentage improvement in the tolerance. A quite different 
approach to offset reduction is represented by the circuit of Fig. 9, 
in which the device incidentally appears to function as a magnetic 
current source. Analysis indicates that in this circuit configuration the 
signal currents in the base layers will be summed in Rz, while the offset 
currents will be nulled to first order, i.e., to the extent that they are 
of the same magnitude in each base layer. While the circuit of Fig. 9 
may not itself turn out to be practical, it illustrates that the device 
can provide enough output information to make at least a first-order 
distinction between the signal and offset. 
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for Decision Feedback Equalization 
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We present the constructive design of finite order equalizer filters for 
data transmission systems employing decision feedback equalization. 
Both transmitter design with power constraints and receiver design with 
ambient noise considerations are treated. Expressions for the filter tap 
settings which maximize a signal-to-noise ratio are found for both baseband 
pulse amplitude modulation and quadrature amplitude modulation 
(QAM) systems. Design examples are given in a passband equivalent 
(of QAM) formulation for an average toll telephone connection. Neglecting 
the possibility of error propagation, these examples demonstrate that 
decision feedback equalization requires fewer taps for acceptable system 
performance as compared to linear equalization. The problem of post- 
cursor size in a decision feedback equalized response is treated and shown 
to diminish in importance when a hybrid equalization procedure ts 
imposed on the linear tap adjustment. The price one pays for allowing the 
linear filter taps to reduce the postcursor sizes in this hybrid equalizer is a 
lower signal-to-noise ratio. 


1. INTRODUCTION 


The advantage of using a nonlinear device, referred to as a decision 
feedback equalizer, to cancel the tails of pulses whose amplitudes have 
already been estimated in a PAM system has long been recognized. 
Figure 1 depicts the typical system in which the decision feedback 
mechanism has always been envisioned to perform this task. Namely, 
by making decisions on a symbol-by-symbol basis and by knowing 
the channel response precisely, a data system would be designed so 
that postcursor (tails of preceding pulses) ISI could be eliminated 
without the ambient noise penalty that a linear filter or equalizer 
imposes. The tacit assumption being made in any decision feedback 
implementation is that the signal-to-noise ratio is high without 
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Fig. 1—Transmitter and receiver filter design. 


equalization, and correct decisions are already being made with high 
probability. 

In this paper we consider the design of finite order nonrecursive 
transmitting and receiving filters which counteract the two remaining 
sources of noise, the precursor tails which are the interfering samples 
of pulses whose amplitudes have not been decided upon, and ambient 
noise, everpresent in a communication system. In addition we seek 
the variation of the signal-to-noise ratio at the moment of decision 
when the sampling time is varied. Along with this variation of the 
criterion of system performance, we are also interested at each sampling 
time in the amount of postcursor ISI noise which the decision feedback 
mechanism is being asked to eliminate. This aspect of our investigation 
yields insight into the feasibility of decision feedback system implemen- 
tation. It is, of course, possible to design the transmitting and receiving 
filters to achieve “hybrid’’ equalization between simple linear equali- 
zation and decision feedback. That is, some of the linear filter’s 
degrees of freedom will be used to combat some postcursor ISI, 
although decision feedback is being used. The idea is to reduce the 
possibility that large postcursor tails will be produced by a linear 
filter whose sole job would otherwise be to reduce precursor ISI. 

The system model we choose to work with is a sampled data or 
discrete one. In addition, the channel and the system’s transmitting 
and receiving filters are assumed to be of finite nonrecursive type. 
Examples are discussed in a later section which involve voice-grade 
toll telephone channel spectra. These spectra have been reduced to a 
specified Nyquist equivalent bandwidth, both for baseband and 
passband applications. The timing involved in going from continuous 
waveforms to sampled data for these examples has been chosen to 
maximize a signal-to-noise criterion before any filtering is done at the 
receiver. Also, in the demodulation process for QAM, the carrier 
phase angle, if fixed, can be absorbed by the receiver’s passband filter 
taps. (For more detail on the system model we use in the following 
sections, see Appendixes A and B.) 
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This paper follows two previous documents!” that have dealt with 
asymptotic performance results concerning decision feedback equaliza~- 
tion. In these previous works, the filters assumed in the decision 
feedback equalization scheme were of infinite length. In contrast, we 
focus our attention here on designing filters of finite, implementable 
length and study a channel which is modeled from transmission data 
taken from the 1969-70 Toll Connection Survey of the Bell System. 


Il. TRANSMITTER AND RECEIVER FILTER DESIGN (BASEBAND) 


We begin by referring to Fig. 1 and denoting the channel responset 
by {h,}o’. We are seeking nonrecursive filter tap weights {a,}j’ and 
{b,}o, N «< M at the transmitter and receiver, respectively. We note 
the total response through the system is then 


{rnparPrn tt = fan}o{hn}d {bn}. (1) 


where *« denotes sequence convolution. 
If we decide to sample at time 7 and cancel? r,, k > 7 through 
decision feedback, then we can define a signal-to-noise ratio 
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p(N, T, a, b)A o°||b||? =e Dy re (2) 
<r 





representing the sampled signal in the numerator and two noise terms 
in the denominator. The first noise term consists of the ambient noise 
which is modified by the receiving filter. [We write b for (bo, b1, ---, Bn) 
in HX*! Euclidean space with (a, b) as the usual inner product and 
||b||2 = (b, b) the usual norm.] We have assumed that the noise 
samples are independent and of generalized variance§ o? and that the 
input binary stream of symbols is independently and fairly signed and 
of unit magnitude. The second denominator term is a measure of the 
precursor ISI. 


2.1 Filter design by integral adjustment 


If we assume that the transmitter filter is to be optimized indepen- 
dently from the receiver filter we are then concerned with the 


T Appendix A explains our use of the sampled response {hn}. We suppress the 
constant multiplier 1/7 which converts the 27! coefficients to time samples (where 
T is the time between samples). 

* We choose to cancel all postcursors. In practice, only a few are cancelled and 
others then become part of ISI term in (2). 

§ By generalized variance we imply that a constant multiplies the true noise sample 
variance. This constant takes into account the sampling speed at which we are 
measuring the signal-to-noise ratio. 
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response : 
{gna TY = [an}de{hn for (3) 


Define a = (do, a1, ---, @v) and hy = (hx, Ae-i, +--+, he_w). We seek 
the maximum of 
di (h,, a)? 
o(N, Ty a) eee ae > (hi, ay? (4) 
k<r 

subject to the constraint that ||a|/? = u?. That is, we place an average 
power constraint on the transmitter. Hence, by constructing the 
quadratic form induced® by the sum of bilinear forms (hy, a)?, we 
reshape p(N, 7, a) into 

(hr, a)? m 
uw *(a, oTa) + (a, Qa)’ 
where J is the (V +1) X (N +1) identity matrix and Q is the 
(N +1) xX (N+ 1) positive semidefinite matrix ( Cres Ae_she-j) 


057,75 N with h, =0,1>0. By use of the Cauchy-Schwartz 
inequality we find readily that the maximum of p(N,7,a) is achieved at 


. . ebtel + Oh, 
a" = Tel + Oh, | (8) 


and the maximum is precisely 


max p(N, 7, a) = p(N, 7, a*) = (hy, (w%o° + Q) hs). (7) 


[fa] [212 


p(N, Ty a) = 


We note that the sequence (ho, hi, ---, h-) is mapped by the vector a* 
into a sequence (go, Ji, °**, g;) Which the receiver is now expected to 
process in forming the following signal-to-noise ratio: 


(g;, b) 
ie oar 1 er — a ER aa CES 8 
o(N, T, a , b) o*||b|| fh y (gi, b)?’ ( ) 


where 
i = (Gi Ge-1, ** +, Gr-w)- 
Since p(N, 7, a*, b) is invariant to any scaling of b, we choose to 


maximize the former with respect to ||b|| = 1. By the same argument 
which led us to (6) and (7), we find 


[ol + R}'g; 


bt = 
[Lor + RY %gz| 


(9) 
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and 


[ol + R]"g;), (10) 





Beet S. 1 . 
p(N, T,a »b ) pa \|[Lo2T + Rye] 8 
where R is the (VN +1) X (N +1) matrix (rer gi-i9i-), 0 S 3, 
7 < N. Hence, p(N, 7, a*, b*) in (10) represents the maximum signal- 
to-noise ratio achievable through the integral or independent adjust- 
ment of transmitter and receiver filters for a decision feedback system 
committed to sampling at time 7 and constrained to use nonrecursive 
linear filters of length N + 1. 

The difference between linear equalization and decision feedback 
can be seen readily by observing the denominator terms of the follow- 
ing signal-to-noise ratio: 

GN satel) eon OP a, GH) 
ee o°||b||? ae sy (Sx, b)? = >= (Sx, b)? 
k<r k>r 

For decision feedback systems, the last term in the denominator 
does not enter the picture because it is assumed it will be eliminated 
without noise penalty. However, in linear equalization, the filter b 
is expected not only to combat precursor ISI but postcursor ISI as 
well, with as little compromise to ambient noise as possible. We can 
rewrite (11) by assuming ||b||? = 1 (ie., scaling irrelevant) 


eect otaee! BEDI = 2 = 
e(N, T, a, b) = (ol ay eee R:)b, b]’ (12) 
where Ri and Rz are, as usual, positive semidefinite channel response 
autocorrelation matrices. Here R; corresponds to precursor distortion 
while R» relates to postcursor ISI. We notice that, if we form for 
Osaxl 
2 (g,, b)? 
pal, 7, 8,8) = TATE R, + ak)b, Bl’ a 
we can continuously vary pe(N, 7, a,b) from the decision feedback 
formulation where a = 0 to the linear equalization case where a = 1. 
Thus, although we implement decision feedback equalization, it is 
possible to design the transmitting and receiving filters so that the 
amount of postcursor distortion is still mildly to strongly influential. 
Of course, a more general formulation of this “hybrid” design tech- 
nique is possible by retracing our steps back to (11) and forming 


- (g,, b)? 
PADS o°||b||? + Dv (Ax gx, b)? + x (Ax gx, db)” 4) 
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where A, are (VN +1) X (N+ 1) diagonal matrices (obviously 
A, = I for the linear equalizer case).' 


2.2 Joint optimization of transmitter and filter design (baseband) 


In the individual design of transmitter and receiver filters treated 
in the last section, we were able to find the optimal filters by a simple 
rearrangement of interference terms and applying the Cauchy- 
Schwartz inequality. We find that for joint filter optimization this 
procedure will be slightly modified and additional steps will be taken 
to arrive at the solution. 

We recall that the total response of the system depicted in Fig. 1 is 


{rapa PY = (an}o {hn} die (dn hd (15) 
and the signal-to-noise ratio: 


2 
o(N, 7, a, b) = CEE 


a*lfbl + > te, Ba)? ie 





where c is the 2N + 1 dimensional vector formed from the sequence 
fan}o*{bn}y and hy, = (hx, hea, +++, Ae-2w). Here again, p (N, 7, a, b) 
is seen to be a continuous function of a and b and functionally in- 
variant to the norm of b. Hence, we constrain our search for the 
optimal b vector by imposing ||b|| = 1. 

The transmitter power constraint was imposed in Section 2.1 by 
lal] = »?. In practical situations, the constraint is more likely to be 
lal] < p?. That is, we want to use only enough power to yield a suffi- 
ciently high signal-to-noise ratio at the receiver. For example, we 
constrain the receiver filter to be of unit norm since the norm is not 
going to contribute toward the enhancement of the signal-to-noise 
ratio at its output. Rather, it will be the transmitter filter power 
output which determines the output signal-to-noise ratio to a large 
extent. A way of solving the joint filter optimization problem with 
constraints, then, is by permitting the transmitter power level to be 
at that as-yet undetermined level so that the signal power through the 
transmitter, channel, and receiver will be at a prespecified ratio to 
that of the ambient noise. Hence, we have the following optimization 
problem: 





(axb, h,)? 
N b) = ; , 17 
jeter pam) eee =, o + > (axb, h,)? (17) 
[bl] =1 [bl] =1 k<r 


T Of course, some constraint must be put on A, to make the maximization of p 
meaningful. 
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Proceeding as before, we obtain 


2 —1 
(axb)* = kel HTH + R) h,, (18) 
(where ko is determined from the constraint ||h*a*b|| = 7) with 
2 —1 
o(N, 7, a*, b*) = (a, (HTH 4. R) h, | (19) 


where H is the 2M + 1 XK 2N + 1 matrix such that H(a*xb) = hxax*b 
where h = (ho, fi, he, +++, har). The matrix R is formed from the 


> Ax_ishe_; terms. Now (18) can be written in its 2— transfer function 
k<r 
representation. 


(axb)* (271) KL + ayer! + age? +--+ + aeye7?%) 
A()BEA) = kQi)Q)-- Qu, 


where k is a determined constant and the Q’s are quadratic factors 
with real coefficients. A choice of the quadratic factors for composing 
A(z) and B(z-) exists. However, since ||b*|| = 1 we are then left with 
a determinable norm for a*. For example, we might choose 


(20) 


* — Q1(z7+) 
B*(z) = Tae)" (21) 
Hence, 
A*(e) = |1Q1(7)|]-&Q2(e7)- > -Qa(e7), (22) 
with norm 


|A* 2) | = lls 2) [1]]Q2@) - Qn eI. 


Regardless of how the quadratic factors are assigned, B*(z') is 
normalized and A*(z-) is then left with some norm value which may 
be large or small. The total norm ||a*h«b||, however, was chosen to be 
n and for each receiver filter chosen from the quadratic factors of (20), 
a corresponding ||A*(z-)|| results. It is of definite engineering interest 
to seek that quadratic factor combination which minimizes ||A*(z~) ||, 
but no obvious solution exists for this combinatorial problem. Other 
considerations may come into play at this point which would obviate 
the need for minimizing ||A*(z-)||. For example, a minimum phase 
requirement for one of the two filters would delineate the two filters. 
Roundoff noise considerations for digital filter implementations might 
also contribute toward selecting one quadratic factor over another at 
the receiver. Cost considerations may warrant the splitting of the 
two filters into equal lengths (N even) so that the number of possible 
quadratic combinations is reduced considerably. In any case, this 
filter-splitting problem is akin to the quadratic factor placement 
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problem in minimizing roundoff noise in digital filter implementations. 
In Appendix D we outline a technique for separating the transmitter 
and receiver filters. 

Of course, it is possible to go through the same generalization on 
postcursor and precursor equalization that we did for the integral 
optimization problems of Section II. We obtain in that case 


2 —1 
o(N, 7, a*, b*) = |. (SHH + Rit as) h, | , (28) 
where 
ie kof % HH Rip alta) “b, (24) 


and where Ri(R2) is the matrix corresponding to precursor (post- 
cursor) interference terms and 0 Sa <1. 


Ill. PASSBAND FORMULATION 


It is possible to extend the results outlined in the previous sections 
to the passband equivalents of transmitter, channel, and receiver for 
a quadrature amplitude modulation (QAM) system.t The extension 
of results is not without complications, since QAM systems suffer 
from another form of distortion—co-channel interference (CCI). 
Thus, the transmitting and receiving filters will be expected to combat 
not only ambient noise and ISI but also co-channel intersymbol 
interference (CCISI). 


3.1 Integral optimization 


We begin by referring to Fig. 2 which illustrates the QAM system 
with decision feedback. We are interested in the transmitter and 
receiver filter designs so that a measure of transmission performance 
is maximized. Namely, we seek to maximize a sampled signal-to- 
generalized-noise ratio similar to that defined in (2). To define the 
terms which will appear in our performance measure, we note that 
the “in-phase” response at the receiver is 


{ (P| MeN = {aP Nal [hO }MaelbPyy — eater a 
— fax? foal {hi ord fal + {[hP foto yo], (25) 
while the ‘“‘quadrature’’ response at the receiver is 
(PSP SEF — (al 4a (P48 ( BL 1ST (ASSL DP HTT 
+ fa? foal {hi yor (bP fo — {hi yor{be? Jo}. (26) 
T We will not concern ourselves with the problems of carrier acquisition and timing 


for the QAM system we consider here in discrete form (see Appendix A for a discus- 
sion of these items). 
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Fig. 2—QAM data system with decision feedback equalization. 


Each channel response has two ISI components, an in-channel inter- 
symbol interference term and the other due to co-channel interference. 
We notice that the CCI is completely eliminated if {b{} = {h{?} 
and {b?} = {hf}. However, in our considerations we will always 
assume M > WN so that our filters do not have a sufficient number of 
degrees of freedom to eliminate CCI (also, this action does constitute 
suboptimal filtering). 

We form the in-phase resultant signal-to-noise ratio for independent 
input channels, independent symbols of unit magnitude with equal 
chance of occurrence and uncorrelated noise samples of variance a’. 
We first treat the case where the receiver filter is all pass (1.e., b = e, 
the identity vector in the algebra of convolution) 


[ia@ he) se be)? 
a” 1 >» [(a®), hj”) es (a‘@, hy”) ape eh 
k<r 
+E La, bY) + @, h)P. (27) 


kr 








p:(N, T, a, b) = 


Now we define 
a= [a®, a‘@ | 
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and h, = [(h”, —h] and h? = [h®,h® ], vectors of 2N + 2 unit 
length. Hence, we can rewrite (27) into 


(a, h,)? 


mee) = FES Gb +S @ BE 


(28) 


In (27) and (28) we have tacitly assumed that at the receiver each 
channel will ‘‘talk” to the other for the purpose of cancelling post- 
cursor CCISI also.t That is, we have assumed a dual system of decision 
feedback equalization is being implemented. 

The maximization of p,(N, 7, a, b) subject to b = e and |lal|? = 2 
leads to a solution similar to that of (6) and (7): 


ulu’o2XT + Q + Q.1h, 


x 
a" = Tel +0 + 0.1] oe 
Snes e(N, T, a, b) = p(N, Ty a*, e) 
bee” 
= u(h,, (uo + Q + Q.)h,) (30) 


le’ + Q4+ QF bl’ 


where Q and Q, are respectively the in-phase and co-channel correlation 
matrices similarly formed, as was the Q matrix of (6). The a* vector 
of (29) separates into a*”) and a*“ and the conditionally optimal 
transmitter bandpass filter is completely specified. Following the 
procedure in Section II, we now hold the transmitter design fixed 
at a* and rewrite (25) as 


{re yor’ | a maw = (DEP $a { fan Fd (A 30} 
— {d4 fore { {ae J {hk 304} 
— {DEP} { {an JO (LP 30%} 
{BP Sof far Jo*{AP Jo} (31) 
= {BP Yo GPP YOO” — (ghar) 
— {dE Jo { (gk? Jorr™ + {oP} F"}, (82) 


where {g{?}¥**, u = pp, pa, ap, aq are recognizable from (31). 
We can now write the expression for p(N, 7, a*, b) as 
: = [(b™, g) — (b@, gp) 
PN na) = aby [b®, a) — O°, e+ 
k<r 
+ 2D L(b™, gi”) + (b®, gi?) P, (33) 


kst 


T Also, we are assuming we will eliminate all postcursor ISI. However, in practice, 
only a few postcursors would be removed. Thus, some postcursor terms would appear 
in the denominator of (28) in that case. 
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where 





ge = (97? — gi, grt — gi, ---, gPtN — gry), (34) 
and similarly for g. To maximize p(N, 7, a*, b) subject to ||b|| = 1, 
we first form the concatenated vectors of 2N + 2 length: 
b = (b?, b’), ge = (g”, —8i”), Bt = (gh, Bt”). (35) 
Hence 
2 
p(N, 7, a,b) = (P, Be) (36) 


o*||b||? + > (b, gx)? + pz (b, gg)?’ 
and we proceed to find that 
max p(N, 7, a*,b) =[g,, (o°l + k + R.)'g,] (37) 


[|b {=1 


achieved at 
b* = (Pl +R+ R.) 1g, 
(ol + R + R.)—g,||’ 


where #& and R, are channel response correlation matrices of the type 
encountered before. 


(38) 


3.2 Joint optimization 


To jointly optimize the transmitter and receiver passband filters, we 
follow virtually the same procedure found successful for the baseband 
case. A comparable factorization problem arises here, for which only a 
combinatorial solution seems to exist. 

The in-phase and quadrature responses through a passband trans- 
mitter, channel, and receiver are given by 


{re Jor te” = {he for (Las? Jo* {Dk Jo — {ak fo’ {bY 0) 
— {hi Jor (ae? Jo'*{i fo + fax? fo* {be Jo) (89) 


{rPJorPPN = {APJ (La fo'e(D }T + alm a {b§” 30) 
+ {A Va ({aP Nab — {al}Nalb}X). (40) 


Rewriting (39) and (40) in terms of a coms passband filter with 


responses c”) = {cf?}§" and c(M = a 
{pP \aeten = {AP } Melo} 2 <aele 2N (41) 
{r{?} aS {AP yore {oh }2 2N + {AA Yah { of? rags (42) 


we form the augmented vectors c = [c‘, c) ], 


h; = a (hi, h®,,, ee, heron, —hf, tee —hAy on) 
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and 


= (HP, WP, ++ Raw, WP, ++, Paw), AY = 0,k < 0,0 = pyg 
Our signal-to-noise ratio becomes for the in-phase channel: 
h,)? 
p(N, 7, €) = iC, Be) (43) 


o*||b||? + (Bc, c) + (Rc, c)’ 


where & and R, are the now-familiar channel correlation matrices and 

= (b®), b‘2). It is easy to show that the norm of the receiver filter 
is irrelevant in the maximization of p(N, 7,c). Hence, we choose 
[b|| = 1. We now specify the amount of signal power 7? we will need 
at the receiver upon choosing the optimal filters. That is, 


M+2N 
¥ | Gos, ©)[? + | (8, 6) |? = of (44) 
But (44) can be rewritten 
oe) 1, (45) 


where Q is a sum of two correlation matrices. Hence, (438) then yields 
the problem: 


is ss (46) 
dk py “1, aren + (Re, c) + (Rc, ¢) 
to which the solution is 

= i(5 oo 4+R+R.) h, (47) 


and 
Repeat ee EE (2 eee Re) hy. 


The constant k is determined from the constraint that (Qc, c) = 7’. 
Since c = (c‘,c)) and the vectors (a), a‘) and (b“), b“) all 
make up c) and c“, we encounter a factorization problem. We can 
choose (b), b‘2)), normalize the receiver filter, and then are left with 
the transmitter filter which has a given norm. This norm is then the 
transmitter power required to produce 7?/o? generalized signal-to- 
noise power at the receiver. 


IV. EXAMPLES 


To illustrate the difference in performance between decision feedback 
and linear equalization, we have taken a telephone DDD toll connec- 
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tion as a linear channel model. Specifically, we would like to know the 
difference in performance on a telephone connection when both linear 
and decision feedback equalization schemes are constrained to use a 
finite number of taps. For comparison, we compute the performance 
asymptotes (infinite number of taps) for each equalization scheme 
(see Appendix C) realized when an infinite number of taps are available. 
We ask whether it is possible to approach these asymptotes with a 
reasonable (implementable) number of taps. Another point which is 
raised in every implementation of decision feedback equalization is 
that of postcursor size. If a mistake in symbol identification is made, 
then the subtraction, for example, of an erroneously signed postcursor 
may lead to a burst of errors if the postcursor size is large. We illustrate 
the postcursor sizes for a passband decision feedback equalization 
system operating on an average telephone connection. 

Figure 3 illustrates the magnitude characteristic of the average 
DDD toll telephone connection as measured in the 1969-70 Toll 
Connection Survey of the Bell System. The corresponding delay 
characteristic follows a parabolic shape and has been numerically 
integrated to yield a phase curve. As discussed in Appendix B, the 
bandpass channel parameters have been calculated for various carriers 
and various flat Nyquist spectral widths assumed at the transmitter. 
The spectral width was controlled by superimposing a cosine rolloff 
(400-Hz width centered at the Nyquist frequency) on the in-phase 
and quadrature spectra. Figures 4 and 5 show typical passband spectra 
computed for this channel. When this decomposition of the bandpass 
channel into in-phase and quadrature responses is achieved, it is 
possible to compute the performance asymptotes for linear and decision 
feedback equalization given in Appendix C. The result of these compu- 
tations is shown in Table I. It is seen that performance decreases 


0 


—20 


LOSS IN DECIBELS 


800 1600 2400 3200 4000 
FREQUENCY IN HERTZ 


Fig. 3—Average amplitude characteristic for toll telephone connection (from 
1969-70 Toll Connection Survey data). 
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Fig. 4—Quadrature amplitude characteristic for average toll telephone connection. 


with data rate and slightly with increased carrier frequency. The gap 
between decision feedback and linear equalization widens as speed 
is increased. For all computations, we have kept the total transmitted 
power through the channel fixed at — 12 dBm, whereas the noise power 
spectral density was kept at that level corresponding to total noise 
power of —48.3 dBm over a 0-3000 Hz bandwidth. This noise level 
is 3 dB weakert than the average noise power measured in the 1969-70 
Toll Connection Survey. 

A finite length receiving filter was increased in length until perform- 
ance was reasonably close to the asymptote given in Table I for that 
speed and carrier. Figure 6 illustrates the difference in performance 
between decision feedback and linear equalization. It is seen that less 
than half the number of taps are required by the decision feedback 
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Fig. 5—In-phase amplitude characteristic for average toll telephone connection. 


T Noise level was made weaker only for computational convenience. 
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Table |— QAM transmission asymptotic SNR in dB for average 
toll telephone connection 
Noise at receiver — 48.3 dBm; transmitted power—12 dBm 





Carrier = 1650 Hz Carrier = 1700 Hz Carrier = 1800 Hz 








L.E. | D.F. | M.F. | LE. | D.F. | M.F. | L.E. | D.F. | M.F. 
2400 31.4 | 31.9 | 32.1 | 30.7 | 31.6 | 31.9 | 29.2 | 30.9 | 31.6 



































L.E. = Linear equalization asymptote. D.F. = Decision feedback asymptote. 
M.F. = Matched filter bound. 


equalizer to achieve a level of performance close to the asymptote. 
In addition, the linear equalizer even with its 36 tap length per channel 
could not keep an acceptable performance level when the data speed 
was increased to 3200 symbols/s/channel. On this basis, the premise 
that decision feedback equalization has significant advantages over 
linear equalization may be too readily accepted. For, if we examine 
postcursor sizes on one of these equalized bandpass channels, we can 
see that the high signal-to-noise ratio offered by decision feedback 
does not come without penalty. Figure 7 illustrates sample sizes of a 
toll telephone channel equalized with a 16-tap (8-feedback) decision 
feedback equalizer. The precursors, or samples before the main sample 
peak, are too small to be seen on this scale. However, it is clear that 
the postcursor adjacent to the signal sample, which is greater than 
half the latter’s size, presents a problem. Should a decision error occur, 
the next signal sample could have its polarity reversed, since more 
than twice its strength could be subtracted out by the decision feed- 
back processor. Thus, error propagation is possible with only a single 
mistake providing the ignition. Let us recall the hybrid equalization 
scheme discussed in Section II. We note in Fig. 8 that, for an alpha 
value of 0.01, we diminish the size of the large postcursor and more 
evenly distribute the heights of all the postcursors to be subtracted 
by the decision feedback processor. It is now apparent that no one 
postcursor is large enough to reverse the polarity of the signal should 
a decision error occur. It will take several consecutive decision errors, 
for example, before this can happen now. However, we lose 1 dB in 
signal-to-noise ratio for this example when we opt for this mitigation 
of the postcursor size problem. Of course, a trade-off exists between 
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Fig. 6—Performance of finite equalizers for average toll telephone connection. 


the loss in signal-to-noise ratio and reduction of postcursor size by 
means of this method. 


V. SUMMARY 


We have treated the design of finite length transmitting and 
receiving filters for a data system employing decision feedback 
equalization. Our purpose here was to examine the difference in 
performance between linear and decision feedback equalization on 
a given data channel. Sequential and joint optimization of trans- 
mitting and receiving filters were treated for an all-Nyquist equivalent 
data system. Although the solutions for the optimum tap settings 
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Fig. 7—Postcursor size problem and mitigation. 


and signal-to-noise ratio were derived in general terms, applying the 
results to the spectrum of a toll telephone connection was of special 
interest. For this channel example, it was found that fewer filter taps 
were required for decision feedback equalization to achieve a reason- 
able performance level. The problem of postcursor size for an overall 
response of a passband decision feedback equalized system can be 
mitigated by a hybrid equalization scheme. The price for allowing the 
linear filter taps to diminish the postcursor sizes in this hybrid equalizer 
is a lower signal-to-noise ratio. 


APPENDIX A 
Details about the discrete channel model 


The lowpass filters in the A/D or D/A conversion process shown in 
Figs. 1 and 2 delimit the channel frequency band which supports data 
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Fig. 8—Total impulse response hybrid equalization. 
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transmission. Hence, the channel can be seen as a bandlimited medium 
and can also be reduced to discrete form for M sufficiently large: 


A(f)=L=hemnr [ps 


nT 


M 
=0 


where 7’ is the data symbol interval or 1/2T is the Nyquist frequency. 
The point here is that {h,} is dependent on the timing chosen for 
this reduction. Obviously, a timing exists which maximizes a signal- 
to-noise ratio, for example, of the unequalized response. We have 
found by experimentation that this timing was an excellent approxi- 
mation to the timing which leads to a maximum signal-to-noise ratio 
after equalization. 

For a bandpass channel, the decomposition into discrete form takes 
place in two steps. First, a carrier frequency is chosen, and in-phase 
and quadrature spectra are then computed. A constant carrier phase 
is then a variable parameter. However, it is easily shown that this 
carrier constant can be absorbed by either the demodulation process 
or the passband equalizer tap settings. 

It is important to recall that the time samples of the spectrum 


M 
H(f) =X hy emimeit 
n=0 


1 , ; ; ; : 
are Lahn jo" . Hence, in the formation of the signal-to-noise ratio: 


he 
p* eT? + ISI 


we form the generalized variance parameter o?T? where o? is the noise 
sample variance. This accounts for this transformation from Fourier 
coefficients {h,}§‘ to time samples {1/7'-h,}2d’. 


APPENDIX B 
Channel data from 1969-70 toll connection survey 


The average loss and delay measurements of over 600 toll voice- 
grade connections made in a 1969-70 survey are recorded in Ref. 4. 
For our channel model, interpolative curves were constructed from 
the average survey measurements made on 20 frequencies. A linear 
loss slope was appended at the lower frequency end to extrapolate loss 
down to zero frequency. The slope of the loss curve in decibels at the 
lowest measurement frequency (200 Hz) was used for this extrapola- 
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tion. A constant was added to the integrated delay curve to achieve 
zero phase at zero frequency. 

Passband responses at several carrier frequencies were then formu- 
lated from the interpolated baseband data. An impulse response was 
calculated for each in-phase and quadrature channel. Timing for the 
two channels was chosen to maximize the squared sampled signal to 
mean square ISI and CCISI before the receiver filter. One hundred 
eight Nyquist samples ({h}}%) and {h{?}3’) represented each 
passband channel. 


APPENDIX C 
Asymptotic MSE as derived by Falconer-Foschini * 

We list here the formulas for the MSE as achieved by linear equali- 
zation and decision feedback for passband systems (here, independent 
binary +1 transmission is assumed with No/2 input noise spectrum). 





(MMSE) tinear = ee 7( =a) =f: 1) af (48) 
(MMSE)a: = exp |r ee log ( Ae + 1) at} , (49) 
where 
run = Ear §) 40+ 8) 





2 


x 





n n 
c( f+ 7) +i0(t + p) 
The passband transmitter and channel characteristics are denoted 


by Gi+ jG. and Ci + C2, respectively. For comparison purposes, 
it is simple to show that the matched filter bound is 


(MMSE) mr = {1 ee (42 “ Lat}. (50) 





It is of interest to note that we can prove that expressions (48), (49), 
and (50) follow the sequence 


(48) S (49) S (50) 


by invoking Jensen’s Inequality for the logarithm as the concave 
function. It is clear that, for the ideal channel and transmitter, i.e., 
Xo(f) = 1/T, we have 

(MMSE) tines = (MMSE)ae = (MMSE)uns = ——5 


1 + (NoT)7 
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APPENDIX D 
A technique for separating transmitter and receiver filters 


We wish to determine that factorization of 
N 
A(z")B(z7) = TT Q, (27), 
n=0 


which minimizes ||A (z—)|| while ||B(e—})|| = 1. Each 
Qn(e4) = 1+ afPet + ae? 


is a quadratic factor. We assume no real roots occur, although the 
extension of the technique we present here to include real roots is 
obvious. Now 


|Q,.(2-) |? = [lente Pag = 1+ (af)? + (af). 


We notice that, upon choosing B(z) = [J] Qn,(z") (where 
mCNeB 
NrUNa = {0, 1, 2,---, N}), then 


IAG Y= TL Qn@ OI TL Onn]. (51) 
nkENB nm GNa 


Thus, what we really want to do is select a partition of the Q; factors 
so that the product of the norms of the partition factors is minimized. 
Much like the quadratic factor partitioning problem in digital filter 
implementation for minimizing roundoff noise, the only method for 
obtaining the global minimum of ||A (z~)|| seems to be the formation 
of all possible combinations of quadratic factors. When WN is large, 
say, 20, this combinatorial method is time-consuming even when the 
filters are forced to be of the same order. 

A technique for constructing the partition which sequentially 
minimizes ||A (z—)|| is first begun by reordering the quadratic factors by 
norm {Qni}{o. We think of the two norms of (51) as bins, and we 
sequentially fill those bins with quadratic factors. We insert one of 
two quadratic factors of largest norms into the first bin and the second 
factor into that same bin. We evaluate the norm of the first bin and 
now compare it to the product of the norms of the individual factors. 
Whichever placement results in smaller norm product, we choose as 
our partition initialization. Thus, at the end of the first step we have 
either 


bin 1 bin 2 
I]QnQnall || 
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or 
binl bin 2 


Qnall HQ nall, 


depending on which product is smaller. The next factor, Q,,, is brought 
into the current partition and the products again are tested as to 
whether Q,, minimizes the product when placed in bin 1 or bin 2. 
The process continues until all quadratic factors are placed into 
elther bin. 

This procedure has been programmed and tested on actual filter 
quadratic factors. It has been our experience that the resulting factori- 
zation was close to the optimal one. To cite an example: Ten quadratic 
factors were randomly placed into two bins 500 times. The product 
of the norms of the two bins’ contents ranged from 0.584 to 1183.33. 
The partition which our procedure yields for this set of quadratic 
factors had the product value of 0.646. Only 36 of the 500 partitions 
yielded smaller products. But little could be gained by using any of 
these 36 partitions. However, the worst partition was four orders of 
magnitude away from the outcome of our procedure. This is possibly 
what is most important, namely finding a partition very far away 
from the worst one. 
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Some Properties of the Erlang Loss Function 


By D. L. JAGERMAN 
(Manuscript received March 5, 1973) 


This paper develops the properties of the Erlang loss function, B(N, a), 
used in telephone traffic engineering. The extension to a transcendental 
function of two complex variables is constructed, thus permitting the 
methods of complex analysis to be employed for the further study of its 
properties. Hxact representations, Rodrigues formulas, and addition 
theorems are given both for the loss function and for the related Poisson- 
Charlier polynomials. Asymptotic formulas and approximations are 
developed for the loss function and also for its derivatives. A table of 
coefficients 1s included which, together with one of the asymptotic formulas, 
permits computation of B(N, a) by simple means even when the number 
of trunks, N, is very large. This same table is used to obtain 0B(a, a)/dzx. 


I. INTRODUCTION 
The Erlang loss function 


P= mn) Sai (1) 


is fundamental to the study of telephone trunking problems. A. K. 
Erlang! used B(N, a) to express the probability that a call, which is a 
member of a Poisson stream of parameter a, arriving at a group of N 
telephone trunks will be rejected. Later studies of trunking problems 
have shown the desirability of enlarging the scope of applications of 
the loss function. For example, the consideration of trunk groups with 
nonintegral number of trunks arises in determining the equivalent 
number of trunks in Wilkinson’s ‘‘equivalent random method.” ? 
Methods for accomplishing the computation by interpolation are given 
by Rapp?’ while continued fraction procedures for accurate computa- 
tion are given by Levy-Soussan‘ and Burke.® Derivatives with respect 
to N and a arise in optimal trunk group size apportionment problems. 
See, for example, Akimaru and Nishimura®’? who studied such models 
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and prepared tables of derivatives. In some investigations, rapid and 
accurate approximate computations of B(N, a) for very large trunk 
groups are needed. This occurred in the study of certain satellite 
telephonic communication problems.®:* The need thus arises of en- 
larging the definition of B(N, a) as given in (1). Of course, that is done 
implicitly in the above investigations. It has been customary to extend 
the definition of B(N, a) by use of an integral formula (Theorem 3) 
ascribed to Fortet. This integral formula is used in (23) to define a 
transcendent, B(z, a), for complex z and a. The extension to the complex 
plane in both z and a permits the powerful methods of complex analysis 
to be applied for obtaining exact, asymptotic, and approximate 
representations. 

It is the purpose of this paper to provide an investigation into the 
properties of B(z, a) with the object of generalizing known results, 
obtaining new results, and presenting practical methods for application 
to the class of problems outlined above. 

Part II derives exact relations satisfied by B(z, a). Similar relations 
for the related Poisson-Charlier polynomials, G;(z, a), are derived in 
the appendix. These relations provide efficient means for exact com- 
putation; thus, Theorems 1 and 2 constitute a practical method of 
computing B(N, a) to a prescribed accuracy for isolated computa- 
tions. Similarly, the use of Theorem 5 enables one to compute B(z, a) 
even for nonintegral number of trunks. Theorem 6 may be similarly 
employed. The relationship of B(z, a) and G;(z, a) to Whittaker func- 
tions as given in Theorems 7 and 24 is the key for linking up these 
functions with the more well-known functions of applied mathematics, 
i.e., hypergeometric functions and Laguerre polynomials. The Rod- 
rigues Theorems 8 and 22 are useful for the evaluation of integrals 
of the form 


/ Ff (a)ae-*B(N, a)—da, J f(ajare"G;(z, a)da (2) 


and, as in the case of Theorem 22, for obtaining an integral representa- 
tion. The addition Theorems 9, 10, 26, and 27 yield group-theoretic 
structure information which is useful for simplifying formulas contain- 
ing these functions, and for the evaluation of integrals. The evaluation 
of an integral, by means of generating functions and Theorem 10, was 
done in Part IV to obtain ultimately an approximate formula for 
0B(x, a)/dx. A general use of the exact relations is to serve as a spring- 
board for asymptotic and approximate results and also for their error 
estimations. This is well illustrated in Part III of the paper. 


526 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1974 


The asymptotic expansions of Part III are also representations of 
B(z, a) but, unlike those of Part II, when used as approximate formulas 
for computation they cannot yield results of arbitrarily high accuracy, 
i.e., the accuracy depends on specific values of parameters. Theorem 
11 is particularly useful for computation when |2/a| is small. It may 
be used for the computation of B(z,a) for fractional number of trunks 
by computing B(z, a) for 0 < z <1 and then using the recurrence 
formula of Theorem 4. Theorem 11 includes well-known asymptotic 
results, e.g., 


1 —1 
B(— 5,4) 


Vrae* “(L — erf Va) ~~ 1 — Sat + a 





eli 

Sgt te, ae, (8) 
B(—1, a) = — ae*E;(—a) ~ 1 — a'4+ 2!a”? 

= sla? +s", a> ow, 


An undesirable feature of many methods of computing B(z, a) is the 
dependence of the computational effort, e.g., time of computation, on 
the value of x; thus, the larger the value of x the greater the computa- 
tional effort. Theorem 14 overcomes this defect; the computational 
effort is independent of the size of x. Theorem 14 is easily usable even 
with a desk machine regardless of how large x is. The accuracy, how- 
ever, depends on z and a parameter c. For fixed c the accuracy improves 
with increasing x. When z is fixed, the accuracy deteriorates when c 
is large and negative but greatly improves as ¢c is increased. To facilitate 
the use of Theorem 14, Table I gives required coefficients, namely, 


ao(c), a1(c), a2(c). To use the table, one computes 
a—x 
c= , 4 
ie es 





then 





B(a, a)! & ao(c)Vx + ax(c) + oe (5) 
Possibly, one should comment that the range of values of xz, ¢ for 
which (5) is accurate is not as important as the fact that it is accurate | 
over a wide range of values of B(x, a), that is, values encompassing 
the ranges of most applications. For quantitative limitations, see Fig. 
1. A method of obtaining dB(z, a)/dx based on Theorem 14 is given 
in Part IV. This uses the formula 
dB(x, a) _ Bia, a)? ae 
dx ON (0 - ] 
ag aa 12 
2x 





B(a, a) {2-14 BG a)+. (6) 
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Table |— Coefficients for evaluation of B(x, a) and dB(x, a)/ax 


c* ao ay a2 Cc ao ay Q2 
—3.0 225.3 2032 13726 0.6 0.8230 0.7274 | 0.1011 
—2.9 167.7 1367 8536 0.7 0.7749 0.7414 | 0.0985 
—2.8 126.0 925.4 5334 0.8 0.7313 0.7552 | 0.0954 
—2.7 95.63 630.5 3348 0.9 0.6917 0.7686 | 0.0920 
—2.6 73.28 432.2 2111 1.0 0.6557 0.7814 | 0.0883 
—2.5 56.70 298.0 1336 1.1 0.6227 0.79387 { 0.0845 
—2.4 44.29 206.7 848.1 1.2 0.5926 | 0.8053 | 0.0806 
—2.3 34.92 144.1 540.2 1.3 0.5649 | 0.8163 | 0.0767 
—2.2 27.80 100.9 345.0 1.4 0.5394 | 0.8267 | 0.0729 
—2.1 22.33 71.07 220.7 1.5 0.5158 | 0.8864 | 0.0691 
—2.0 18.10 50.27 141.4 1.6 0.4940 0.8455 | 0.0654 
—1.9 14.80 35.71 90.70 1.7 0.4739 0.8540 | 0.0619 
—1.8 12.21 25.49 58.17 1.8 0.4551 0.8619 0.0585 
—1.7 10.16 18.27 37.28 1.9 0.4876 | 0.8694 | 0.0552 
—1.6 8.521 13.15 23.86 2.0 0.4214 | 0.8763 | 0.0521 
—1.5 7.205 9.522 15.23 2.1 0.4062 0.8828 | 0.0492 
—1.4 6.139 6.936 9.692 2.2 0.3919 0.8889 | 0.0464 
—1.3 5.271 5.090 6.141 2.3 0.3786 0.8946 | 0.0438 
—1.2 4.557 3.772 3.872 2.4 0.3661 0.8999 | 0.0413 
—1.1 3.968 2.830 2.430 2.5 0.3543 0.9049 | 0.0390 
—1.0 3.477 2.159 1.519 2.6 0.3432 0.9095 | 0.0368 
—0.9 3.066 1.682 | 0.9486 2.7 0.3327 0.9139 | 0.0347 
—0.8 2.721 1.344 | 0.5960 2.8 0.3228 0.9179 | 0.0328 
—0.7 2.428 1.108 0.3816 2.9 0.3134 | 0.9218 | 0.0309 
—0.6 2.178 | 0.94385 0.2540 3.0 0.3046 0.9254 | 0.0292 
—0.5 1.964 | 0.8318 | 0.1804 3.1 0.2962 0.9287 0.0276 
—0.4 1.780 | 0.7580 | 0.1398 3.2 0.2882 0.9319 0.0261 
—0.3 1.620 ; 0.7112 0.1187 3.3 0.2806 0.9349 0.0247 
—0.2 1.481 0.6840 0.1089 3.4 0.2734 0.9377 0.0234 
—0.1 1.360 0.6705 0.1052 3.5 0.2666 | 0.9403 0.0222 

0 1.253 0.6667 0.1044 3.6 0.2600 0.9428 | 0.0210 

0.1 1.159 | 0.6696 | 0.1048 3.7 0.2538 0.9451 0.0199 

0.2 1.076 | 0.6771 0.1052 3.8 0.2478 0.9473 0.0189 

0.3 1.002 0.6877 0.1052 3.9 0.2421 0.9494 0.0179 

0.4 0.9357 0.7000 | 0.1045 4.0 0.2367 0.9514 } 0.0170 

0.5 0.8764 0.7135 0.1031 

a-—x 
Calculate c = ——— , then 
Va 


B(a, a) ~ aoVax + ai + a2/Vz, 


OB(z, a) Ba, a)? _@\ x+a {2 - }. 
ae os aes (a ; On B(a, a) : 1 + B(a, a) 








* Standardized offered load. 


It is appropriate to mention, at this point, another method of approxi- 
mating B(x, a) by means of a formula whose computational effort 
is also independent of x and which, similarly, is applicable over a wide 
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Fig. 1—Five-percent-error contour. 


range of values of B(z, a). This method is described in Refs. 10 and 11. 
A comparison of this method with that of Theorems 1 and 2 is given 
at length in a report by S. Miller.® 

Derivatives and inequalities on derivatives are given in Part IV. 
Theorem 15 extends the well-known derivative formula for B(x, a) 
with respect to the real variable a. Theorem 17 provides an accurate 
approximation for 0B (a, a)/dx. Empirically, the accuracy seems to hold 
to four significant figures or better over a very wide range of values 
of x and a. Of significance is the corollary which shows that the ap- 
proximate value obtained is always too small. If a quick appraisal of 
the derivative is desired, Theorem 18 may be used. The logarithmic 
convexity properties of B(x, a) given in Theorem 19 provide the useful 
bounds of the corollary on the second derivatives. Also an application 
is given to the logarithmic interpolation of Theorem 20. This is very 
useful when, for example, one wishes to compute B(x, a) for x be- 
tween consecutive integers, say N, N + 1, and for which B(N, a), 
B(N + 1, a) are known. An extension of this idea is provided by 
Theorem 21 which permits accurate computation of B(z, a). 

It may be remarked that generally relations, representations, and 
asymptotics for B(z, «)~! are simpler in structure than those for 
B(z, «) and may provide greater numerical accuracy in computations. 


ERLANG LOSS FUNCTION 529 


Il. CONVERGENT REPRESENTATIONS 


The study of telephone trunking problems, whether in equilibrium or 
transient condition, or even nonstationary,” engenders the Erlang 
loss function, B(N, a), which initially arises in the form" 


a if a gt ; 
BWN,a == / dvs, N 20 (integral), a> 0. (7) 


For these reasons and for the purposes of studying certain forms arising 
in queuing theory related to B(N, a) and also for the facilitation of 
numerical evaluation, it is useful to represent the loss function in 
diverse ways. 

The numerical computation of B(N, a) as given in (7) is awkward 
when a and N are large since then both numerator and denominator 
are large. A form well adapted to numerical work is 


ll 


N 

Nani, 

j=0 (8) 
N®=1, N®=N(N—1)---(N—j7+4+1) (j > 0), 


B(N, a)” 


which follows from 


NN! . N N! 
1 = oN) BieN ck eke tia? 
B(N, a) p> 7 ai »y (n AN 


A modified form of (8) is given in Theorem 1. 


N 
j= > Nai, (9) 
j=0 


Theorem 1: 


y—1 
BWW, a) =  N®ai + N@™a’B(N — », a), yzo0 
j=0 
Proof: Since 
NG) = NO(N — »)M, (10) 
one has, from 
v—1 N 
B(N, a)1t = ¥ N@Mai+ Nas, (11) 
j=0 jev 


N N-y N-v 
23 NWaq-i = >: NGtMg-iH = NM’ >: (N — vp) Pars 


j=y F=0 j=0 
N™a-’B(N — »v, a)“. (12) 
The formula of the theorem follows from (11) and (12). 
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Corollary: The case v = 1 implies the known" difference equation 


1 
oe 
aB(N — 1, a) 


BUN, a) = 
1 er 


R. Franks suggested using the value of B,(N, a) defined by 
v—-1 
B,(N,a) =1/ Nai (13) 
7=0 


to approximate B(N, a) in which, for any small number 7 > 0, the 


index vy is chosen so that 
Nor 4, (14) 


Theorem 2 bounds the error of the method. 


Theorem 2: 
B,(N, a)(1 — n) S B(N, a) S B,(N, a). 


Proof: From Theorem 1 one has 


1 — 
BIN n\-l L NO)a~RIN .. w\-l < B,(N, a). 
B,(N, a) + N™a-’B(N — », a) BIN, a) < 3B (N a) (15) 
Thus 
BIN, a) _ 1 


[ae es, a ee aa ae ep wer a LL ee! 16 
B,(N,a) 1+ Na-’B,(N, a)B(N — », a)“ 8) 
Since Na -’ is strictly monotone increasing as a function of N, (8) 
shows that 

B(N + 1, a) < B(N, a) (17) 
for all N = 0; thus 


2c (18) 
B,(N, a) 1+ 7B,(N, a)B(N, a) 
and hence 
BIN, a) 5 
B,(N, a) ~ 
The theorem follows from (15) and (19). 


An integral representation, ascribed to Fortet,!®> may be obtained for 
BUN, a). 


1 — 4. | (19) 


Theorem 8: 
fo a) 
0 


B(N, a) = af e-94(1 + y)Ndy. 
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Proof: From the Eulerian integral 


i * ayldy =TU+ Io, > 1, (20) 
one obtains 
N@ai=a Cc ) if e—*¥yidy. (21) 
Use of (8) now yields 


00 N N . fo) 
B(N, a) = af gy ( : ) vidy = af e°v(1 + y)%dy. (22) 


The integral representation now permits extending B(N, a) into 
the complex plane with respect to both N and a. One defines 


B(z, a) = / ” e-au(1 + y)edy (23) 


in which z, a may both be complex. Clearly, B(z, a)~ is an entire func- 

tion of z for Rea > 0 (Re designates ‘“‘real part’’). The symbols N, a 

will be used for nonnegative integers and positive reals, respectively. 
A generalization of Theorem 1 is given in Theorem 4. 


Theorem 4: 


v—1 
Bz, a)! = SY 2Ma-F + 2Mar-"B(z — v, a)“, Rea > 0. 
j=0 


Proof: Integration by parts of (23). 
It is of interest to investigate the relationship of B(z, a) to the func- 
tion 


¥(Z,a) = e* Te +1) (24) 


which is an extension of the Poisson distribution function, y(JN, a), 
with parameter a. The function ¥(N, a) is a good approximation to 
B(N, a) when a is much less than N. Exact relations between B(z, a) 
and ¥(z, a) are given in Theorems 5 and 6. These relations provide 
convenient means of calculation of B(z, a) for general z, a; e.g., in 
trunk group blocking problems when a nonintegral number of trunks 
is considered. 


Theorem 8: 


BOMBS 2 ere geea) 


S 
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The series converges uniformly everywhere in Rez > — 1, Rea > O. 


Proof: Let u = 1+ y in (28), then 


B(z, a) = ae i. * e-auyzdu; (25) 
hence, 
Bz, a) = act f° e*"u7du — af eX A-Hyzdu, (26) 
and 
BOO BNC. ots f * pau y edu, (27) 


To exhibit the integral in (27) as an inverse factorial series, consider 
the beta function integral 


/ wl — uj du = mony | (28) 


which yields the special case (s 2 O integral) 


1 s! 
Use of the expansion 
ey = YX yy) (30) 
s=0 8: 


in (27) and subsequent use of (29) yield the result of the theorem. 
The Mittag-Leffler expansion for the integral of (27) leads to 
Theorem 6: 
-1 — —1 fod % = s at 
BG, a) = vat +e Y (“I ote gy: 


Conditions of convergence are the same as in Theorem 5. 


Proof: The expansion 
em = Y (-1)1* (31) 
s=0 8: 
used in (27) leads immediately to the required result. 
Whittaker functions,’® W;,,m(z), play a useful role in the discussion 


of B(z, a) and of Poisson-Charlier polynomials to be introduced later. 
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They may be introduced by 


Ww (z) = a ee. i et (1 ame eal eT 32 
Fem! TE —k Fm) Jo Z . - (82) 
Re (} —k +m) > 0, larg z| <m. 
Theorem 7: 


Bz, a) = e*etW 2 19,¢241)/2(@). 
Proof: Let ¢ = ay in (23), then 
Bz, a) = / e-t (1 + ~ at (33) 
0 
The required result follows on comparison with (32). 


A Rodrigues type of relation for B(N, a)-! may be obtained from 
Theorem 3. 





Theorem 8: 
B(N + M, a)" gas = enegat - 
(N + M, a)! = (=1) "ace 5 [e-*a1B(N, a)-4]. 
Proof: From Theorem 3, one has 
eta tB(N, a)! = f° ceo + y) Ny; (34) 
0 


hence, 


M 00 
(— 1)" 55 [eta BW, a)*] = f° eer + y)*Mdy. 5) 
a 0 
The formula for B(N + M, a) now follows on multiplication by ae’. 


Corollary : 
N 


B(N, a) = (—1)%ae* —_ [e-*a ]. 





Additional formulas for B(z, a) (as a function of a) provide con- 
venient means of computation for values of a near some fixed point. 
Two such formulas are given in Theorems 9 and 10. 

Theorem 9: 
_ Net n ee oe 
Bona += (1+) = ( ) Banya) (E). 
a v=0 \P a 
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Proof: The function S,(a) given by 





nr at 
Sila) = paar (36) 
is an Appell polynomial, that is, 
dS» 
a = Sn-1(@). (37) 


Thus the Taylor expansion for S,(@ + #) can be written in the form 


S@t)= > J Sn-e(a)e’ (38) 


One obtains from (7) 
B(n,a) = n!a7"S,(a) (39) 


and hence the theorem follows on substitution into (88). 
Theorem 10: 
= t An aed i eS 
Batt) = aie e > Ble + v, a)’, 


v=0 y! 
Rea > 0, |t| < Rea. 





Proof: Let 
L(z, a) = ea B(z, a)“, (40) 
then, from (23), 


d d . 
eneels — — epter —a(l+y) + z 
ace I(z, a) ] é i é a a y) dy, 


a Gay eeu) (1 + y)* Ody, 
0 


= I(z + 1, a), (41) 
and hence 


£. Ca eon Ce ee (42) 


Thus, by Taylor’s formula, 
a ees — Cee (43) 
y=0 VP! 


Substitution of (40) into (48) yields the required result. One has 


L(z + v, a) —_ pai © e7a(lty) (1 a y)***dy, (44) 
0 
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hence the terms of (43) are O[(¢/Re a)’»®°*]. The stated convergence 
criterion now follows. 


ill. ASYMPTOTIC EXPANSIONS 


Particularly simple and convenient forms for theoretical and numer- 
ical applications may be obtained by examining asymptotic expansions. 


Theorem 11: 


[re] 
Biz, a3 ~ YS 2™Mar’, a—>o, |largal <n. 
v=0 


Proof: The asymptotic expansion for Wy,m(z) is'® 
Wi, m(z) ~ eek 
yf ge S& Lmta —By Tint = 68)". Lm 9 $4)"7 
vel viz : 
z—>o, |largz| <-. (45) 
Substitution of the parameter values given by Theorem 7 establishes 
the result. 

It should be remarked that the error, when using the partial sum 
>a) 2a” to approximate B(z, a)-!, does not exceed |a| |z“a-*| / 
Re a provided Rez S k, Rea > 0. This follows directly from Theorem 
4 and (23). 

For large z, one has 


Theorem 12: 


Bz, a) ~ pz, a) — Xu Celene. 


z—2 ©, |arg z| < 2/2, uniformly in any bounded region of the a-plane 


for which Rea > 0. 


Proof: The representation of Theorem 5 is used. One must show 


as’ 


C2 aCe EEE a) 
=0( app etm) 


Biz, a) in v(z, a) = 


that is, 


. as” 
ete ere: (47) 
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Let Re z = 2, then one has 


as” 


x @Fa Fl €F8 
1 zs 
< BT a eS 
si l"Gratp ery 
Let v = 2 + n andl = s — n, then the dexter of (48) is 
00 ta|? (49) 


2 @ FD) WFD 
Use of (29) and (30) on (49) yields 7 


as” 


, 
Zetat Dy era Self, eermwaw 60) 








thus 
jalelat 


ae jas 


as 


The theorem is proved. 

Useful asymptotic formulas are obtained when both a and z have 
infinite limits but approach infinity in a fixed ratio, that is, a = cz, ¢ 
fixed. The cases c > 1, c = 1 are discussed by A. Descloux" for large 
real z. Theorem 13 generalizes the result for c > 1 to complex z and 
provides the structure of the coefficients for the complete expansion. - 
The case c = 1 is obtained as a corollary to Theorem 14. where the 
result is also generalized to complex z. 


yo, (51) 











Theorem 18: $e a 


Bz, cz)" ~ 2 ge, 
1=0 








Proof :* One has, from (23) -_ 
B22) = cz f° ee24(] ee oe #A BZ) 


“The author wishes to thank C. L. Mallows for this pro which apie a uch 
longer proof originally supplied by the author. 
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Defining the function h(y) by 
h(y) = cy —In (1+ y), (58) 


one may write, since h(0) = 0, h(~) = ~, and h(y) is monotonic 
increasing, 


B(z, cz) = af enzh(y) wea) 1 dh. (54) 


The factor c(1 + y)/[ce(1 + y) — 1] is now expanded in powers of h 
as follows: 


c(1 + y) = h! 
cy) ©) 


A theorem on Abelian asymptotics for Laplace transforms!® and (54), 
(55) yield the asymptotic behavior of B(z, cz)—! for z— ©, |argz| <71/2; 
thus, 


Bz, cz) ~ > giz. (56) 


The coefficients g; may be evaluated as follows. Let 





w=c—Inc, (57) 
and 
é 
k(w) = 7 (58) 
then 
2 eZ _ 0 + y) 
bw +h) = Ke +9) -~ med +H] =a ey. (60) 
Thus, Taylor expansion yields 
see eo 
fc) 1 xa) RCo). oy) 
One has 
d c h6Ud 
dw c—ldc’ ey) 
hence 
d \! c d\' ¢ 
(45) #™ = (4%) rae ae a (62) 
The following formula is obtained directly from Theorem 13. 
i pe Oe OD gee 
Bie?) e—1 (e€—-1)%2 (€—1)52" (63) 


The evaluation and behavior of B(z, a) for a in a neighborhood of z 
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is often of interest; accordingly, the function B(z, z + cyz) will be 
considered for 2 — ©; c is a fixed real number. 


Theorem 14: There exists a representation of the form 


B(z, 2 + cv2) ~ ¥ aj(ce)e- GP, 
j=0 


Tv 
2 0, jargz| <5, c real, 
an which 


ioe] 
ao(c) = ei? i edu, 


c 


ie a Vy Diag 
ai(c) = 3 + 36 3 cao(c), 
wintie? areas 1 5 a 3 { 1 1 1 6 Pa 4 1 
a2(c) i8 ° 36 © 12° (a5 a 12 o(0). 


Proof: From (23), one has 
B(z, 2 + evz)7 = (2 + ceva) i e-(etevz)u(J + u)edu, (64) 
0 


larg z| < > 
Let u = v/ vz, then 
B(z, 2 + cvz)" 


a eG teh (py, z)dv, 
; (65) 
h(v, 2) 


I 


Meas uv \ 
: (1+ 5) Geto. 


Let K be a positive constant, then, for |v| < K, h(v, 2) clearly possesses 
an asymptotic development in yz uniformly in v; thus, 


h(v, 2) ~ ¥ bj(v, e-G-Y2, zz &, (66) 
j=0 


in which the coefficients b;(v, c) are polynomials in v. In particular, 


bo(v, c) = 1, 
bi(v, c) = 50 +, (67) 


be(v, c) = xo — fo + a v8, 
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Since 
e Getter)yke], (0, re) ) (68) 


for each k > 0 and any c, termwise integration of (66) leads to the 
required asymptotic expansion. Thus, letting 


aj(c) = f° e-G+mb;(0, odd, (69) 
0 
one has 
Bz, 2 + eva ~ ¥ aj(c)e- EP, (70) 
j=0 


The formulas for ao(c), a1(c), a@2(c) stated in the theorem are obtained 
by evaluation of (69) using b;(v, c) as given in (67). 


Corollary : 
Bg at~ Je tet fz 
: 2 3 12 N22’ 


T 


zZ—> ©, larg 2| 5 


Proof: The result is obtained from Theorem 14 with c = 0. 

This theorem helps explain the phenomenon of the efficiency of 
large trunk groups since even when a > 2, B(a, a) is small as long as a 
is in a small neighborhood of x; thus, Theorem 14 shows that 
Bia, x + cz) ~ 1/aovz, rx > &. 

Theorem 14 shows that the parameter c may be viewed as a standard- 
ized offered load measuring the deviation of a from z in units of Vz. 
The value of this viewpoint is derived from the very simple approximat- 
ing form for B(z, a); thus, 

Bia, a)! & aov% + ay + met (71) 
An application of this is to the computation of dB(a, a)/dx given in 
(92). Another advantage is the capability of computing B(x, a) by 
means of a single-entry table against the standardized offered load c 
rather than the usual double-entry table against x and a. 

Table I gives the values of ao(c), ai(c), ae(c) for —-8 Sc $4 in 
steps of 0.1 with the intention of covering a practical range of values 
of B(x, a). As an illustration, it is desired to compute B(400, 378). 
Use of (71) with c = — 1.1 gives the result 0.0122 correct to the last 
figure. If c does not appear in the table, then interpolation is used. For 
example, to compute B(400, 377.6) for which c = — 1.12 linear inter- 
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polation in the table of coefficients and use of (71) yields 0.0118 cor- 
rect to the last figure. The method, of course, is valid even when the 
number of trunks is nonintegral. Consider, for example, B(400.34, 
420) for which c = 0.98463. The result obtained by linear interpolation 
in the table is 0.0713 correct to half a unit of the last figure. 

The accuracy deteriorates when «x is decreased or when c is large and 
negative. Thus, for B(10, 8), one obtains 0.12144 as against the correct 
value 0.12166. In this case c = — 0.6325 is not too disadvantageous. 
The case B(10, 5) for which c = — 1.58114 yields a much greater 
error, namely, 0.0256 as against the correct value 0.0184. This error 
occurs, however, for a small trunk group where exact calculation is 
quite feasible. To aid the delineation of suitable regions of (c, x) for 
which the table is accurate, a curve is given in Fig. 1 defining 5-percent 
error. When a computation is made from the table using any point 
(c, x) in the unbounded, unshaded region, the error incurred will be 
less than 5 percent of the true value of B(z, a). 


IV. DERIVATIVES AND INEQUALITIES 
It is desired to obtain formulas for the derivatives of B(z, a), with 
respect to z and a. 


Theorem 18: 


GO) 12-14 Be, «| Bea, Rea > 0. 
Oa a 


Proof: From (23), one has 
—l oe) ce) 
ees =| e—*4(1 + u)*du — af e-**(1 + u)tudu; (72) 
Oa 0 0 
hence, 


OB (z, a) 


= 1 Be a) — Bee + 1,0)" + Ble, a). (73) 
0a a 


Use of Theorem 4 provides the relation 


-1 
dB (z, a) Perel 2 Biz, a)7! — 1 + Bia, a), (74) 
da a 
Since 
-1 
CE OT eRe eee Oo. (75) 
Dix 0a 


the result of the theorem follows from (74). 
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For the purpose of obtaining an approximate formula for the deriva- 
tive with respect to z, consider 


flu) = aB(a, aje(1 + u)? (76) 
in which a > 0, x > 0, and for which, by (23), 


[Oo sau = 1. (77) 


It is convenient to introduce the random variable & with density func- 
tion f(u). The power moments yp, defined by 


by = HE, r > 0 (integral), (78) 
are given in the following theorem. 
Theorem 16: 
ae = Boz, a) & (-1)" (7) B@ +1, 0). 
1=0 


Proof: Define a generating function ¢(t) by 


g(t) = He? = aB(a, a) { e-(-)4(] + u)*du, (79) 
0 
then, since 
(a — )B(z, a — 1) f° e941 + u)sdu = 1, (80) 
0 
one has 
= aB (a, a) 
CO = DB Ga a) St) 
Use of Theorem 10 in (81) provides the expansion 
00 —1 
o(t) = Ba, ae y ae e. (82) 
Since 
oO) = Lae, (83) 


the coefficient of t” in the expansion of (82) in powers of t yields the 
required result. Thus 


= (1) B@ +1, a) 
tc=— BE, ayr! 2 Ge es Ta 





(84) 
and the formula of the theorem follows. 


542 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1974 


Corollary: The central moments a, are given by 

a = E(E— ws)" = Bea) © (-)"'( 7) a + NB] FL a) 
Proof: The same as for Theorem 16 but considering the function 
e—1'&(t) instead of P(t). 


An approximation to 0B(xz, a)/dx may now be obtained. 


Theorem 17: 


= = dB (a, a) = ox 1 a2 
B(x, a)! —— = In (+ wy) (ee aoe 
] a3 1 
Proof: From (23) and (76), one obtains 
Hie ay a Shines (85) 
Since, by use of the mean value formula, 
In (1+ 8) = In (1 + ws) + — (E— ) — 5 (El? 
: PT Gaga ee ee > 


1 1 
30 bm? 7 


one has, from (85) and the corollary to Theorem 16, the required 
result. 


4 mn)? 7 (E—m)', 0<0<1, (86) 


Corollary : 
— B(x, a) Bo) 1m 1 a 


<mO +e) 90 F apt 3 OF a 


Proof: The error term of Theorem 17 is omitted. 

For ready reference the following formulas are given in which 
B= B(a, a), Bi = B(a + 1, a), Bo = Bia + 2,4), Bs = B(x + 3, a), 
By= Biz + 4, a). 


wi = — 1+ BBz', 
ae = (ui + 1)? — 2(u. + 1)BBy* + BBz", (87) 
a3 = — (41 +1)? +3(u1. + 1)BBr' — 3(41 + 1I)BBr’ + BB;", 


2 
] 


(a1 + 1)4 — 4(u1 + 1)3BBrt + 6(u1 + 1)°>BBz* 
. — 4(ui + 1)BB;* + BBs". 
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The evaluation of Br', By!, By1, By? is facilitated by successive use 
of Theorem 4. 

An alternative method of obtaining 0B (a, a)/dz is based on Theorem 
14. Let 
f@,c) = B@,a), a=x2+ ena, (88) 





then, from Theorem 14, 
—1 oo a 
FeO FL Ra (e-vr, 20; (89) 
j=0 
hence, 
Of (x, ¢)7 1 Qo(c 
a = ag {no — |. (90) 


Thus, the computation of df(x, c)/dxz is easily accomplished with the 
help of Table I and the formula 


949 — 56, 92 SEO w — BO faye) — 90}. gy 


One now has 


OB(z,a) df(x,c) dAB(a, a) c 
Ox a da (: * a o) 


A simple upper bound on — B(z, a)dB(a, a)/dx] is given in the 
following theorem. 


Theorem 18: 
—B(a, a)7 a8, @) < In (1 + yy). 


Proof: Since the function — In (1 + u) is convex for u = 0, the re- 
quired inequality follows from Jensen’s inequality, namely, 


g( Et) S Eg(é) (93) 


valid for functions g(x) convex over the range of the random variable 
é, and (85). 

A function g(x) > 0 is said to be log-convex over a set if In g(x) is 
convex over the set. It is known!® that the sum of log-convex functions 
is log-convex and hence that the integral of a log-convex function with 
respect to a parameter is log-convex provided the function is log-convex 
for every value of the parameter. Since a necessary and sufficient 
condition that a twice-differentiable function be convex is the non- 
negativity of its second derivative over the corresponding set, one 
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derives the inequality 
g’g—-g* 20 (94) 


as a necessary and sufficient condition that g > 0 be log-convex. One 
now has 
Theorem 19: B(x, a)—, [aB(az, a)]— are log-convex functions of x and 


of a, respectively, fora > 0 and all x. 


Proof: The results are immediate from (23) and the observations that 
(1 + u)# is log-convex as a function of x for u = 0, and e~** is log- 
convex as a function of a for u = 0. 


Corollary : 
B(a, a) 


aB(z, a) | 22°) 24 oS | | Be, a) + a Ba) oY. 


Proof: Use of (94). 
An immediate application of Theorem 19 is to the logarithmic inter- 
polation of B(z, a), that is, linear interpolation of In B(a, a). 


a’B(a, a) — | AB(a, a) |? 
a Ox , 


Theorem 20: Let a, b, p,q > 0,p + q = 1, then 


B(x, a)?By, a)? S B(px + gy, a), 
[aB (x, a) ]?[bB(a, b)]? S (pa + gb)B(z, pa + qb). 


Proof: Jensen’s inequality applied to —In B(x, a) and —In [aB(z, a), 
respectively. 

An extension of the result of Theorem 20, for the purpose of obtain- 
ing an approximate formula for B(z, a) when x is not an integer, may 
be derived from the corollary to Theorem 16. Let N = [xv], = x — N, 
and a, be the central moments computed for the density function 


f(u) = aB(N, aje*(1 + u)%, (95) 
then one has 
Theorem 21: 
k-1 
Bz, a) = BUN, a) F ("asl + ms)" + BOY, a) (7) et, 
k even, lja| < 


Proof: Let & be the random variable with density function f(u), then 
B(x, a) = BIN, a)E(1 + &)*. (96) 
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Since 


a+or= 2 (1) ature m+ (Z) -ad, 07 


r= 


the result follows from (96) and the corollary to Theorem 16. 
A useful special case of Theorem 21 is 


B'-*B,e 


B(a, a) & — > 
1 Bi 


(98) 





in which 


B=B(N,a), Bi=B(N+1,a), Be=B(N+2,a). (99) 


V. CONCLUSION 


Further investigations would be desirable; for example, one would 
like to know the contour function g(z) for which B[z, g(z) ] is constant. 
Truncation error formulas for the asymptotic expansions of Theorems 
13 and 14 would be useful; also, the general structure of the coefficients 
a;(c) of Theorem 14 should be determined. Asymptotic formulas of 
various types should be obtained for G;(z, a) similar to those given for 
B(z, a). These formulas may then be used to obtain asymptotic results 
for its zeros which are needed in many transient’ and time-variable 
blocking analyses. 
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APPENDIX 


The function B(z, a) is related to the Poisson-Charlier poly- 
nomials*°-” much used in telephone traffic studies. Let 


Yolz, a) = vz, a), 
di 
¥; (2, a) oe doa ¥ % a), (100) 
then the Poisson-Charlier polynomials, G;(z, a), are defined by 


v5 (2, a) = v(z, a)G;(z, a). (101) 
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The Taylor expansion 


CE es Sle a) (102) 


j=0 


yields the generating function” 
: t\) ti 
et(1tly = ¥ Ge, a) 5 (103) 
a j=0 Jj: 
Thus, explicit formulas for G;(z, a) are 


G;(z, a) = a Ds (—1) ( 2 \2 


Jay vl 


= 3 (—1)*”” ( ) ria (*). (104) 
»=0 v v 
The first few polynomials are 
Go(Z, a) = 1, 


Gi (2, a) = = @ — a), 


i (105) 
G2(Z, a) = a [22 — (2a + 1l)z + a*], 
G3 (Zz, a) = 7 Les — 8(a + 1)2 + (8a? + 38a + 2)z2 — a®]. 
A recurrence relation derived from (103) is 
Girl, a) = 2 —L— 26,2, 0) — 1610, 0). (106) 


The polynomials, G;(z, a), possess many properties analogous to 
those of B(z, a). A Rodrigues formula is given in 


Theorem 22: 
qd* 
G44(2, a) = a *e* da* [e~*a?G;(z, a) J. 
Proof: One has from (100) 


Visel2, @) = £2 vile, «), (107) 
and hence 
Crvle, o¥(, a) = £5 We, @)Gs(2, a) I (108) 


The result follows on use of (24). 
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Corollary : 
di 
G;(zZ, a) = a-*e* od [e-*a? ]. 


Proof: 
Go(z, a) = 1. (109) 


An integral representation for G;(z, a) is given in 
Theorem 28: 
yy (eae =. (a , OF 7 — a jg 2—1 
Gs(—2,) = (“1 gy feed + w)ytdy, 
Rea > 0, Rez > 0. 
Proof: From (20), one has 
1 ioe) 
Oy Za —a(l+y)q,2—1 
eax Te) i e Wy2ldy. (110) 


Substitution of (110) into the corollary of Theorem 22 yields the result. 
Theorem 23 permits obtaining a Wittaker function representation. 


Theorem 24: 
Gi (2, a) = (—1)faT FFD PQA WO 1 o4.1y/2, G—2)/2(@). 


Proof: Comparison of Theorem 23 with (32) and replacement of —z 
by z. 

The representation of Theorem 24 remains valid, by analytic con- 

tinuation, even outside the region of convergence of the integral of (32). 


Corollary 1: 
B(N, a) = (—1)%Gy(—1, a). 


Proof: Comparison of Theorems 7 and 24. 
Corollary 2: 
Gy(z, @) = = saz — 1, a) — Gale, a). 


Proof: Substitution of the representation of Theorem 24 into the re- 
currence relation” 


Wrm(2) = VzWe-y.m—y(2) + F —k + m)Wi-am(2) (111) 


yields the result. 
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Corollary 3: 
G;(2, a) = Gi(z — 1,0) +2 Ga — 1,2). 
Proof: Same as for Corollary 2, except the following recurrence rela- 
tion is used: 
Wim(2) = VeWi-y.mig(2) + (2 — & ~ m)Weam(2). (112) 
A representation of G;(z, a) in terms of B(z, a) is given in 
Theorem 25: 
ee ee ey ce ae a ee 
ONESIES AEN). ay ae ce) 


Proof: From (23), 
1 a be — aL N 
aB(N, a) =f € v(1 =F y) dy, (113) 
and Theorem 23, 


Gw(— ja) = (I Gy [ema + yay, (114) 


one has the result on use of 


j~-l oe) (a) 
oi fend + a) May = (1) [er + y)%yrtdy. (018) 


dat 


The Poisson-Charlier polynomials possess addition formulas similar 
to those of B(z, a)— as given in Theorems 9 and 10. 


Theorem 26: 
t\-i 2 J t \’ 
Geato=(1+5) (J )a.@a(-(E): 
Qa v=0 v Qa 
Proof: Use of (103) shows that the system of functions 
~1)i] . 
( 7 | IG; (Z, a) 





. 


has the generating function e*'(1 — t)*, and hence*® form an Appell 
system with respect to a, thus, 





ta SP itil a) | . a oF NG ;1(2, a). (116) 


The Taylor expansion of [(—1)#/7!](a + t)'G;(z, a, + ¢) in powers of 
é now yields the required result. 
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Theorem 27: 
Gat =(14b)"e ¥ Gale a 4 
Qa vy =0 v 


The series 1s permanently convergent. 


Proof: Use of (107) and Taylor’s expansion yields 
Cy t’ : 
¥iZ,a+t) = ol Vitv(Z, a). (117) 
The result is now obtained from (101) and (117). Since, from (104), 
z\3 : 
Giea~(Z), joe, (118) 


the convergence is permanent. 
An asymptotic expansion is given by 


Theorem 28: 
Gs(2, 0) ~ pf +E (-p 


i 





|, a—> 0, 


Proof: The result is obtained directly from (104). It may also be ob- 
tained from Theorem 24 and (45). 
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